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Abstract 



> 

CO 

#y\ Many evolution problems in physics are described by partial differential equations on an 

infinite domain; therefore, one is interested in the solutions to such problems for a given initial 
£N| dataset. A prominent example is the binary black hole problem within Einstein's theory of 

7—i gravitation, in which one computes the gravitational radiation emitted from the inspiral of the 

two black holes, merger and ringdown. Powerful mathematical tools can be used to establish 
. — qualitative statements about the solutions, such as their existence, uniqueness, continuous 

j^ dependence on the initial data, or their asymptotic behavior over large time scales. However, 

one is often interested in computing the solution itself, and unless the partial differential equa- 
tion is very simple, or the initial data possesses a high degree of symmetry, this computation 
requires approximation by numerical discretization. When solving such discrete problems on 
a machine, one is faced with a finite limit to computational resources, which leads to the 
replacement of the infinite continuum domain with a finite computer grid. This, in turn, leads 
to a discrete initial-boundary value problem. The hope is to recover, with high accuracy, the 
exact solution in the limit where the grid spacing converges to zero with the boundary being 
pushed to infinity. 

The goal of this article is to review some of the theory necessary to understand the contin- 
uum and discrete initial-boundary value problems arising from hyperbolic partial differential 
equations and to discuss its applications to numerical relativity; in particular, we present well- 
posed initial and initial-boundary value formulations of Einstein's equations, and we discuss 
multi-domain high-order finite difference and spectral methods to solve them. 
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1 Introduction 

This review discusses fundamental tools from the analytical and numerical theory underlying the 
Einstein field equations as an evolution problem on a finite computational domain. The process 
of reaching the current status of numerical relativity after decades of effort not only has driven 
the community to use state of the art techniques but also to extend and work out new approaches 
and methodologies of its own. This review discusses some of the theory involved in setting up 
the problem and numerical approaches for solving it. Its scope is rather broad: it ranges from 
analytical aspects related to the well posedness of the Cauchy problem to numerical discretization 
schemes guaranteeing stability and convergence to the exact solution. 

At the continuum, emphasis is placed on setting up the initial-boundary value problem (IBVP) 
for Einstein's equations properly, by which we mean obtaining a well posed formulation which is 
flexible enough to incorporate coordinate conditions which allow for long-term and accurate stable 
numerical evolutions. Here, the well posedness property is essential in that it guarantees existence 
of a unique solution which depends continuously on the initial and boundary data. In particular, 
this assures that small perturbations in the data do not get arbitrarily amplified. Since such small 
perturbations do appear in numerical simulations because of discretization errors or finite machine 
precision, if such unbounded growth were allowed, the numerical solution would not converge to 
the exact one as resolution is increased. This picture is at the core of Lax' historical theorem, which 
implies that the consistency of the numerical scheme is not sufficient for its solution to converge 
to the exact one. Instead, the scheme also needs to be numerically stable, a property which is the 
discrete counterpart of well posedness of the continuum problem. 

While the well posedness of the Cauchy problem in General Relativity in the absence of bound- 
aries was established a long time ago, only relatively recently has the IBVP been addressed and 
well posed problems formulated. This is mainly due to the fact that the IBVP presents several 
new challenges, related to constraint preservation, the minimization of spurious reflections, and 
well posedness. In fact, it is only very recently that such a well posed problem has been found for 
a metric based formulation used in numerical relativity, and there are still open issues that need 
to be sorted out. It is interesting to point out that the IBVP in General Relativity has driven 
research which has led to well posedness results for second order systems with a new large class of 
boundary conditions which, in addition to Einstein's equations, are also applicable to Maxwell's 
equations in their potential formulation. 

At the discrete level, the focus of this review is mainly on designing numerical schemes for 
which fast convergence to the exact solution is guaranteed. Unfortunately, no or very few general 
results are known for nonlinear equations and, therefore, we concentrate on schemes for which 
stability and convergence can be shown at the linear level, at least. If the exact solution is 
smooth, as expected for vacuum solutions of Einstein's field equations with smooth initial data 
and appropriate gauge conditions, at least as long a no curvature singularities form, it is not 
unreasonable to expect that schemes guaranteeing stability at the linearized level, perhaps with 
some additional filtering, are also stable for the nonlinear problem. Furthermore, since the solutions 
are expected to be smooth, emphasis is here placed on using fast converging space discretizations, 
such as high order finite difference or spectral methods, especially those which can be applied to 
multi-domain implementations. 

The organization of this review is at follows. Section [3] starts with a discussion of well posedness 
for initial value problems for evolution problems in general, with special emphasis on hyperbolic 
ones, including their algebraic characterization. Next, in Section HI we review some formulations 
of Einstein's equations which yield a well posed initial value problem. Here, we mainly focus 
on the harmonic and BSSN formulations which are the two most widely used ones in numerical 
relativity, as well as the ADM formulation with different gauge conditions. Actual numerical 
simulations always involve the presence of computational boundaries, which raises the need of 



analyzing well posedness of the IBVP. For this reason, the theory of IBVP for hyperbolic problems 
is reviewed in Section [5] followed by a presentation of the state of the art of boundary conditions 
for the harmonic and BSSN formulations of Einstein's equations in Section[6j where open problems 
related with gauge uniqueness are also described. 

Section [7] reviews some of the numerical stability theory, including necessary eigenvalue con- 
ditions. These are quite useful in practice for analyzing complicated systems or discretizations. 
We also discuss necessary and sufficient conditions for stability within the method of lines, and 
Runge-Kutta methods. Sections [8] and [9] are devoted to two classes of spatial approximations: 
finite differences and spectral methods. Finite differences arc rather standard and widespread, so 
in Section [8] we mostly focus on the construction of optimized operators of arbitrary high order 
satisfying the Summation By Parts property, which is useful in stability analyses. We also briefly 
mention classical polynomial interpolation and how to systematically construct finite difference 
operators from it. In Section [9] we present the main elements and theory of spectral methods, 
including spectral convergence from solutions to Sturm-Liouville problems, expansions in orthog- 
onal polynomials, Gauss quadratures, spectral differentiation, and spectral viscosity. We present 
several explicit formulae for the families of polynomials most widely used: Legendre and Cheby- 
shev. Section [TO] describes boundary closures. In the present context they refer to procedures for 
imposing boundary conditions leading to stability results. We emphasize the penalty technique, 
which applies to both finite difference methods of arbitrary high order and spectral ones, as well 
as outer and interface boundaries, such as those appearing when there are multiple grids as in 
complex geometries domain decompositions. We also discuss absorbing boundary conditions for 
Einstein's equations. Finally, Section [TT| presents a random sample of approaches in numerical rel- 
ativity using multiple, semi-structured grids, and/or curvilinear coordinates. In particular, some 
of these examples illustrate many of the methods discussed in this review in realistic simulations. 

There are many topics related to numerical relativity which are not covered by this review. It 
does not include discussions of physical results in General Relativity obtained through numerical 
simulations, such as critical phenomena or gravitational waveforms computed from binary black 
hole mergers. For reviews on these topics we refer the reader to |2I5j and 326, IT91 . respectively. 
See also Refs. [7J [H] for recent books on numerical relativity. Next, we do not discuss setting up 
initial data and solving the Einstein constraints, and refer to [130] . For reviews on the characteristic 



and conformal approach, which are only briefly mentioned in Section 6.4 we refer the reader to 
and 168 , respectively. Most of the results specific to Einstein's field equations in Sections E and 
[6] apply to four-dimensional gravity only, though it should be possible to generalize some of them 
to higher-dimensional theories. Also, as we have already mentioned, the results described here 
mostly apply to the vacuum field equations, in which case the solutions are expected to be smooth. 
For aspects involving the presence of shocks, such as those present in relativistic hydrodynamics 
we refer the reader to Refs. [160, 287J. Finally, see |342) for a more detailed review on hyperbolic 
formulations of Einstein's equations, and |339] for one on global existence theorems in General 
Relativity. Spectral methods in numerical relativity are discussed in detail in [209] . The 3 + 1 
approach to General Relativity is thoroughly reviewed in |208j . Finally, we refer the reader to [123] 
for a recent book on General Relativity and the Einstein equations which, among many other 
topics, discusses local and global aspects of the Cauchy problem, the constraint equations, and 
self-gravitating matter fields such as relativistic fluids and the relativistic kinetic theory of gases. 

Except for a few historical remarks, this review does not discuss much of the historical path to 
the techniques and tools presented, but rather describes the state of the art of a subset of those 
which appear to be useful. Our choice of topics is mostly influenced by those for which some 
analysis is available or possible. 

We have tried to make each section as self-consistent as possible within the scope of a man- 
ageable review, so that they can be read separately, though each of them builds from the previous 
ones. Numerous examples are included. 



2 Notation and Conventions 

Throughout this article, we use the following notation and conventions. For a complex vector 
u G C m , we denote by u* its transposed, complex conjugate, such that u ■ v :— u*v is the standard 
scalar product for two vectors u,v G C m . The corresponding norm is defined by |u| := y/u*u. The 
norm of a complex, m x k matrix A is 

\A\ := sup i— pL. 

«GC fc \{0} \ u \ 

The transposed, complex conjugate of A is denoted by A*, such that v ■ (Au) — (A*v) ■ u for all 
u € C k and v G C"\ For two Hermitian m x m matrices A — A* and B = B* , the inequality 
A < B means u ■ Au < u ■ Bu for all u £ C". The identity matrix is denoted by I. 
The spectrum of a complex, m x m matrix A is the set of all eigenvalues of A, 

a(A) := {A e C : XI - A is not invertible }, 

which is real for Hermitian matrices. The spectral radius of A is defined as 

p{A) :=max{|A| : A G cr(A)}. 

Then, the matrix norm \B\ of a complex mx k matrix B can also be computed as \B\ = y/p(B*B). 
Next, we denote by L 2 (U) the class of measurable functions / : U C W 1 — s- C m on the open 
subset U of R™ which are square- integrable. Two functions /, g G L 2 (U) which differ from each 
other only by a set of measure zero are identified. The scalar product on L 2 (U) is defined as 



(f,g):= J f(x)*g(x)d n x, f,geL 2 (U), 

u 

and the corresponding norm is |j/|| := y/(f, /). According to the Cauchy-Schwarz inequality we 
have 

(/,<?>< U/ll IMI, LgeL 2 (U). (1) 

The Fourier transform of a function / belonging to the class Cq°(R") of infinitely differentiable 
functions with compact support is defined as 

f(k) ■= j^j- 2 J e- ik -*f(x)d n x, fceR". (2) 

According to Parseval's identities, (f,g) = (f,g) for all f,g e C£°(R n ), and the map C^(R n ) -> 
L 2 (R n ), f i-> / can be extended to a linear, unitary map T : L 2 (R n ) — > L 2 (R n ) called the Fourier- 
Plancharel operator, see for example Ref. [335J. Its inverse is given by F^ 1 (f)(x) = f(—x) for 
/ G L 2 {W l ) and x eW l . 

For a differentiable function u, we denote by Ut, u x , u y , u z . its partial derivatives with respect 
to t, x, y, z. 

Indices labeling gridpoints and number of basis functions range from to N. Superscripts and 
subscripts are used to denote the numerical solution at some discrete timestep and gridpoint, as in 

v* := v(t k ,Xj). 

We use boldface fonts for gridfunctions, as in 

v fe :={ V (t,,x,)}f =0 . 



3 The Initial Value Problem 

We start here with a discussion of hyperbolic evolution problems on the infinite domain K" . This 
is usually the situation one encounters in the mathematical description of isolated systems, where 
some strong field phenomena takes place "near the origin" and generates waves which are emit- 
ted toward "infinity" . Therefore, the goal of this section is to analyze the well-posedness of the 
Cauchy problem for quasilinear hyperbolic evolution equations without boundaries. The case with 
boundaries is the subject of the Section [5j As mentioned in the introduction, the well-posedness 
results are fundamental in the sense that they give existence (at least local in time if the problem 
is nonlinear) and uniqueness of solutions and show that these depend continuously on the initial 
data. Of course, how the solution actually looks like in detail needs to be established by more 
sophisticated mathematical tools or by numerical experiments, but it is clear that it does not make 
sense to speak about "the solution" if the problem is not well posed. 

Our presentation starts with the simplest case of linear constant coefficient problems in Sec- 
tion [XT] where solutions can be constructed explicitly using Fourier transform. Then, we consider 



in Section 3.2 linear problems with variable coefficients which we reduce to the constant coefficient 



case using the localization principle. Next, in Section 3.3 we treat first order quasilinear equations 
which we reduce to the previous case by the principle of linearization. Finally, in Section [3~4| we 
summarize some basic results about abstract evolution operators, which give the general frame- 
work for treating evolution problems including not only those described by local partial differential 
operators, but also more general ones. 

Much of the material from the first three subsections is taken from the book by Kreiss and 
Lorenz [253j . However, our summary also includes recent results concerning second order equations, 
examples of wave systems on curved spacetimes, and a very brief review of semigroup theory. 

3.1 Linear, constant coefficient problems 

We consider an evolution equation on n-dimensional space of the following form: 

ut = P(d/dx)u = Y^ A v D v u, i£l", t > 0. (3) 

Here, u = u(t,x) £ C m is the state vector, and u t its partial derivative with respect to t. Next, 
the A^'s denote complex, m x m matrices where v — (i/i, ^2, • • ■ , v n ) denotes a multi-index with 
components Vj 6 {0, 1, 2, 3, . . .} and \v\ := V\ + . . . + v n . Finally, D v denotes the partial derivative 
operator 

dx V i ■ ■ ■ dxn n 

of order \v\, where Dq := /. Here are a few representative examples: 

Example 1. The advection equation u t (t,x) — Xu x (t,x) with speed A € R in the negative x 
direction. 

Example 2. The heat equation u t (t,x) — Au(t,x), where 

~* dx\ dx\ dx^ 

denotes the Laplace operator. 

Example 3. The Schrddinger equation u t (t,x) = iAu(t,x). 



Example 4. The wave equation U tt = At/, which can be cast into the form of Equation (pi), 



1 \ ( U 



"i i A j ». - y v 

We can find solutions of Equation pi) by Fourier transformation in space, 

u(t,k) = —^je- ik - x u(t,x)d n x, keR n , t > 0. 

Applied to Equation ([3]) this yields the system of linear ordinary differential equations 

u t = P(ik)u, t > 0, (4) 

for each wave vector k £ M™ where P(ik), called the symbol of the differential operator P(d/dx), 
is defined as 

P(ik) := J2 MihT 1 ■■■{ik n ) l/ ", k£R n . (5) 

\"\<P 

The solution of Equation Q is given by 

u(t,k) = e p(lfc) *u(0,fc), i>0, 

where w(0, fc) is determined by the initial data for u at t = 0. Therefore, the formal solution of the 
Cauchy problem 

u t (t,x) =P(d/dx)u(t,x), iet", i>0, (6) 

u(0,x) = f(x), xel", (7) 

with given initial data / for u at t = is 

u(t,x) = —^—Je* k -*e p ^ t f(k)d n k, xel", <>0, (8) 

where /(fc) = j^ J e^ k - f(x)d n x. 

3.1.1 Well-posedness 

At this point, however, we have to ask ourselves if the expression QSp makes sense. In fact, we do 
not expect the integral to converge in general. Even if / is smooth and decays to zero as |fc| — > oo 
we could still have problems if |e p ( lfc )*| diverges as |fc| — >• oo. One simple, but very restrictive, 
possibility to control this problem is to limit ourselves to initial data / in the class <S W of functions 
which are the Fourier transform of a C°°-function with compact support, i.e. / £ 5 W , where 

S- := | V (.) = j^^ J e* k <\(k)d n k : v £ C °°(IR")| . 

A function in this space is real analytic and decays faster than any polynomial as |a;| — > ooF] 
If / £ S^ the integral in Equation (p| is well-defined and we obtain a solution of the Cauchy 
problem (l6j[7| which, for each t > lies in this space. However, this possibility suffers from several 
unwanted features: 

1 More precisely, it follows from the Paley- Wiener theorem (see Theorem IX. 11 in Ref. [335]) that / £ S" with 
f(k) = for |fe| > R if and only if / possesses an analytic extension / : C" — > C such that for each N = 0, 1, 2, . . . 
there exists a constant Cjv with 

H|Im(C) 

1/(01 - c "(ITicF' CGC "- 
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• The space of admissible initial data is very restrictive. Indeed, since / G 5 W is necessarily 
analytic it is not possible to consider nontrivial data with, say, compact support and study 
the propagation of the support for such data. 

• For fixed t > 0, the solution may grow without bound when perturbations with arbitrarily 
small amplitude but higher and higher frequency components are considered. Such an effect 
is illustrated in Example [6] below. 

• The function space S" does not seem to be useful as a solution space when considering linear 
variable coefficient or quasilinear problems, since for such problems the different k modes do 
not decouple from each other. Hence, mode coupling can lead to components with arbitrarily 
high frequencies r\ 

For these reasons, it is desirable to consider initial data of a more general class than S^ . For this, 
we need to control the growth of e p ^ lk ' t . This is captured in the following 

Definition 1. The Cauchy problem M f?p is called well posed if there are constants K > 1 and 
q61 such that 

| e P(ife)t| < Ke at for allt>0 and allkeW 1 . (9) 

The importance of this definition relies in the property that for each fixed time t > the norm 
| e -P(* fc )*| f the propagator is bounded by the constant C(t) := Ke at which is independent of the 
wave vector k. The definition does not state anything about the growth of the solution with time 
other that this growth is bounded by an exponential. In this sense, unless one can choose a < or 
a > arbitrarily small, well posedness is not a statement about the stability in time, but rather 
about stability with respect to mode fluctuations. 

Let us illustrate the meaning of Definition [T] with a few examples: 

Example 5. The heat equation u t (t,x) — Au(t,x). 

Fourier transformation converts this equation into Ut(t,k) = — \k\ 2 u(t, k). Hence, the symbol is 

P(ik) = -\k\ 2 and \ e Pi - ik ^\ = e^ 1 < 1. The problem is well posed. 

Example 6. The backwards heat equation ut{t,x) = —Au(t,x). 

In this case the symbol is P(ik) — +|fc| 2 , and |e p ( ifc )*| = e' fc ' *. In contrast to the previous case, 
e P{ik)t exhibits exponential frequency-dependent growth for each fixed t > and the problem is 
not well posed. Notice that small initial perturbations with large |fc| are amplified by a factor that 
becomes larger and larger as |fc| increases. Therefore, after an arbitrarily small time, the solution 
is contaminated by high-frequency modes. 

Example 7. The Schrodinger equation ut{t,x) = iAu(t,x). 

In this case we have P(ik) = i\k\ 2 and |e p ( i ' c ) t | = 1. The problem is well posed. Furthermore, the 
evolution is unitary, and we can evolve forward and backwards in time. When compared to the 
previous example, it is the factor i in front of the Laplace operator that saves the situation and 
allows the evolution backwards in time. 

Example 8. The one-dimensional wave equation written in first order form, 

u t (t,x) = Au x (t, x), A=( 1 

2 In this regard we should mention the Cauchy-Kovalevskaya theorem (see, for example Ref. [156p which always 
provides unique local analytic solutions to the Cauchy problem for quasilinear partial differential equations with 
analytic coefficients and data on a non-characteristic surface. However, this theorem does not say anything about 
causal propagation and stability with respect to high-frequency perturbations. 
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The symbol is P{ik) = ikA. Since the matrix A is symmetric and has eigenvalues ±1, there exists 
an orthogonal transformation U such that 

Therefore, |e p ^ fc '*| = 1, and the problem is well posed. 

Example 9. Perturb the previous problem by a lower order term, 

u t (t,x) =Au x (t,x) + Xu(t,x), A=( ), Ael. 

The symbol is P(ik) — ikA + XI, and |e p ^ fe ^*| = e A *. The problem is well posed, even though the 
solution grows exponentially in time if A > 0. 

More generally one can show, see Theorem 2.1.2 in Rcf. [253 1 : 

Lemma 1. The Cauchy problem for the first order equation Ut = Au x + B with complex m x m 
matrices A and B is well posed if and only if A is diagonalizable and has only real eigenvalues. 

By considering the eigenvalues of the symbol P{ik) we obtain the following simple necessary 
condition for well posedness: 

Lemma 2 (Petrovskii condition). Suppose the Cauchy problem MM) is well posed. Then, there 
is a constant a€l such that 

Re{X) < a (10) 

for all eigenvalues X of P(ik). 

Proof. Suppose A is an eigenvalue of P(ik) with corresponding eigenvector v, P(ik)v = Xv. Then, 
if the problem is well posed, 

Ke at \v\ > |e p ^*t;| = \e xt v\ = e Re(A) *M, 

for all t > and all k € R n which implies that e Re ( A )* < Ke at for all t > 0, and hence Re(A) < 

a. a 

Although the Petrovskii condition is a very simple necessary condition, we stress that it is not 

sufficient in general. Counterexamples are first order systems which are weakly but not strongly 



hyperbolic, see Example 10 below. 



3.1.2 Extension of solutions 

Now that we have defined and illustrated the notion of well-posedness, let us see how it can be 
used to solve the Cauchy problem (|6j [7| for initial data more general than in 5" . Suppose first 
that / G S u , as before. Then, if the problem is well posed, Parseval's identities imply that the 
solution pi must satisfy 

|K*, .)|| = \\u(t, .)|| - ||e p W*/ll < Ke at \\f\\ = Ke at \\f\\, t > 0. 
Therefore, the 5^-solution satisfies the following estimate 

|Kt,.)||<tfe at ||/|| ) <>0, (11) 
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for all / £ S u . This estimate is important because it allows us to extend the solution to the much 
larger space L 2 (W l ). This extension is defined in the following way: let / G L 2 (R n ). Since S u 
is dense in L 2 (U. n ) there exists a sequence {fj} in S" such that \\fj — f\\ —> 0. Therefore, if the 



problem is well posed, it follows from the estimate ( 11 ) that the corresponding solutions Uj defined 
by Equation ([8]) form a Cauchy-sequence in L 2 (IR n ), and we can define 

U{t)f{x) := lim — -i^ ( e ifc -e p ( ifc ) t / i (fc)d"fc, x£l", t > 0, (12) 

where the limit exists in the L 2 (R n ) sense. The linear map U(t) : L 2 (R n ) -> L 2 (R n ) satisfies the 
following properties: 

(i) U(0) — I is the identity map. 

(ii) U(t + s) = U(t)U(s) for all t, s > 0. 

(iii) For / e S w , u(t, .) = U(t)f is the unique solution to the Cauchy problem (TgI [TJ) . 

(iv) ||J7(t)/|| < Ke at \\f\\ for all / e L 2 (R n ) and all t > 0. 

The family {U(t) : t > 0} is called a semi-group on L 2 (M. n ). In general, U(t) cannot be extended 
to negative t as the example of the backwards heat equation, Example [6l shows. 

For / e L 2 (W l ) the function u(t,x) :— U(t)f(x) is called a weak solution of the Cauchy 
problem (pi |7p. It can also be constructed in an abstract way by using the Fourier-Plancharel 
operator T : L 2 {W l ) -> L 2 (R n ). If the problem is well posed, then for each / e L 2 (R n ) and t > 
the map k H> e P( - lfc -'*J r (/)(fc) defines an L 2 (K n )-function, and hence we can define 



«(*, •) := ^ _1 (e P(l)t -F/) , t > 



0. 



According to Duhamel's principle, the semi-group U(i) can also be used to construct weak 
solutions of the inhomogeneous problem, 

ut(t,x) = P{d/dx)u{t,x) + F(t,x), iel", t>0, (13) 

u(0,x)=f(x), xeK", (14) 

where F : [0, oo) -> L 2 (R n ), t ^ F(t, ■) is continuous: 

u(t, •) = U(t)f + fu(t- s)F{s, -)ds. (15) 



For a discussion on semi-groups in a more general context see Section [3.4| below. 

3.1.3 Algebraic characterization 

In order to extend the solution concept to initial data more general than analytic one, we have 
introduced the concept of well-posedness in Definition [I] However, given a symbol P(ik), it is not 
always a simple task to determine whether or not constants K > and a € R exist such that 
| e ,p(ifc)*| < Ke at for alH > and k G W 1 . Fortunately, the matrix theorem by Kreiss 251 provides 
necessary and sufficient conditions on the symbol P(ik) for well-posedness. 

Theorem 1. Let P(ik), k S W 1 , be the symbol of a constant coefficient linear problem, see Equa- 
tion |5|) ; and let a£l. Then, the following conditions are equivalent: 
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(i) There exists a constant K > such that 

\e p{lk)t \<Ke at (16) 

for allt>0 andke R n . 

(ii) There exists a constant M > and a family H{k) of m x m Hermitian matrices such that 

M' 1 ! < H(k) < MI, H(k)P(ik) + P(ik)*H(k) < 2aH(k) (17) 

for allkeW 1 . 

A generalization and complete proof of this theorem can be found in Ref. |253j . However, let us 
show here the implication (ii) =4> (i) since it illustrates the concept of energy estimates which will 
be used quite often throughout this review (see Section 3.2. 3| below for a more general discussion 



of these estimates). Hence, let H(k) be a family of m x m Hermitian matrices satisfying the 



condition (|17|). Let k £ R" and v £ C m be fixed, and define v(t) := e p (' fc )*v for t > 0. Then, we 
estimate for the "energy" density v(t)*H(k)v(t), 

v(t)*H(k)v(t) = [P(ik)v(t)]*H(k)v(t)+v(t)*H(k)P(ik)v{t) 



have the following estimate for the "energy" density v(t)*H(k)v(t), 

d_ 

dt 

= v(t)* [P(ik)*H(k) + H(k)P(ik)} v(t) 

< 2av(t)*H(k)v(t), 
which implies the differential inequality 

j[e- 2at v{t)*H{k)v{t)] <0, * > 0, k£R n . 
Integrating, we find 

M->(t)| 2 < v(t)*H{k)v(t) < e 2at v*H(k)v < Me 2at \v \ 2 , 



which implies the inequality ( 16 ) with K = M 



3.1.4 First order systems 

Many systems in physics, like Maxwell's equations, the Dirac equation, and certain formulation of 
Einstein's equations, are described by first order partial differential equations (PDEs). Even if the 
system is described by a higher order PDE, it can be reduced to a first order one by introducing 
new variables. Therefore, let us specialize the above results to a first order linear problem of the 

form 

n ft 

u t = P{d/dx)u = J2 A3 jn u + Bu ^ xeR n , t> 0, (18) 

where A 1 , . . . , A n , B are complex m x m matrices. We split P(ik) = Pa(ik) + B into its principal 

n 

symbol, Po(ik) = i J] kjA^, and the lower order term B. The principal part is the one that 

3=1 

dominates for large \k\ and hence the one that turns out to be important for well-posedness. 
Notice that Po{ik) depends linearly on k. With these observations in mind we note: 

• A necessary condition for the problem to be well posed is that for each k £ R™ with \k\ — 1 
the symbol Po(ik) is diagonalizable and has only purely imaginary eigenvalues. To see this, 
we require the inequality 

| e |fc|Po(ifc')t+Bt| < Xe «* ? k':=^r, 

\k\ 
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for all t > and k £ M. n , k ^ 0, replace t by t/\k\, and take the limit |fc| — > oo, which yields 
| e P (2fc )t| < ^ f or a jj ^' g jgn w n\i | /-'| = l. Therefore, there must exist for each such k' 
a complex m X m matrix S(k') such that S(k')~ 1 Po(ik')S(k') — iA(k'), where A(fc') is a 
diagonal real matrix (cf. Lemma [I]). 

• In this case the family of Hermitian m x m matrices H(k') := (S(k')~ 1 )*S(k')~ 1 satisfies 

H(k')P (ik') + P {ik')*H(k') = 
for all k' € R" with |jfe'| = 1. 

• However, in order to obtain the energy estimate, one also needs the condition M~ l I < 
H(k') < MI, that is, H(k') must be uniformly bounded and positive. This follows automat- 
ically if H(k') depends continuously on k' , since k' varies over the (n — l)-dimcnsional unit 
sphere which is compactP] In turn, it follows that H(k') depends continuously on k' if S(k') 
does. However, although this may hold in many situations, continuous dependence of S(k') 



on k' cannot always be established, see Example 12 below for a counterexample 



These observations motivate the following three notions of hyperbolicity, each of them being a 
stronger condition than the previous one: 



Definition 2. The first order system (18) is called 



(i) weakly hyperbolic if all the eigenvalues of its principal symbol Po(ik) are purely imaginary. 

(ii) strongly hyperbolic if there exists a constant M > and a family of Hermitian m x m 
matrices H(k), k € S n , satisfying 

Ar l I < H{k) < MI, H(k)P {ik) + P (ik)*H{k) = 0, (19) 

for all k €E S"^ 1 , where S" 1 ^ 1 := {k £ K™ : \k\ — 1} denotes the unit sphere. 

(Hi) symmetric hyperbolic if there exists a Hermitian, positive definite mx m matrix H (which 
is independent of k) such that 

HP (ik) + P {ik)*H = 0, 

for alike S 71 - 1 . 

The matrix theorem implies the following statements: 

• Strongly and symmetric hyperbolic systems give rise to a well posed Cauchy problem. Ac- 
cording to Theorem [T] their principal symbol satisfies 

|e F <>W|<if, fceM™, tel, 

and this property is stable with respect to lower order perturbations, 

| e P(«)t| = | e P (zfc)t+B t | < Ke K\B\ t> k £ RB) t £ ^ 



The last inequality can be proven by applying Duhamel's formula ( 15 ) to the function 
u(t) := e p ( lfc )'/ which satisfies u t (t) = P (ik)u(t) + F(t) with F(t) = Bu{t). The solu- 
tion formula (15) then gives \u(t)\ < K(\f\ + \B\ f* \u(s)\ds) which yields \u(t)\ < Ke K ^ B ^\f\ 



upon integration. 



3 In fact, we will see in Section |3.2| that in the variable coefficient case, smoothness of the symmetrizer in k' is 
required. 
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• As we have anticipated above, a necessary condition for well-posedness is the existence of 
a complex m x m matrix S(k) for each k e S 1 " -1 on the unit sphere which brings the 
principal symbol Po(ik) into diagonal, purely imaginary form. If, furthermore, S(k) can 
be chosen such that |<S(fc)| and ^(/c) -1 ! are uniformly bounded for all k £ 5™ _1 , then 



H{k) := (S(k) 1 )*S(k) 1 satisfies the conditions (19) for strong hyperbolicity. If the sys- 



tem is well posed, Theorem 2.4.1 in Ref. 253 shows that it is always possible to construct 



a symmetrizer H(k) satisfying the conditions (19) in this manner, and hence, strong hy- 
perbolicity is also a necessary condition for well posedness. The symmetrizer construction 
H(k) := (S(k)~ 1 )*S(k)~ 1 is useful for applications, since S(k) is easily constructed from the 



eigenvectors and S(k) l from the eigenfields of the principal symbol, see Example 15 below 



• Weakly hyperbolic systems are not well posed in general because |e p °( lfc '*| might exhibit poly- 
nomial growth in \k\t. Although one might consider such polynomial growth as acceptable, 
such systems are unstable with respect to lower order perturbations. As the next example 
shows, it is possible that |e p ( lfc )*| grows exponentially in \k\ if the system is weakly hyperbolic. 

Example 10. Consider the weakly hyperbolic system [253] 

ut=( J M«x + of "! + ] )u, (20) 



o i ; x v -i -i , 

with aela parameter. The principal symbol is P§(ik) = ik ( I and 



1 



p p a (ik)t _ ikt I 1 ikt 
1 1 



Using the tools described in Section [2] we find for the norm 



| e P (ife)ti _ 




which is approximately equal to \k\t for large \k\t. Hence, the solutions to Equation (20) contain 
modes which grow linearly in \k\t for large \k\t when a = 0, i.e., when there are no lower order 
terms. 

However, when a^O, the eigenvalues of P{ik) are 

A± = ik — a ± i\/a(a + ik), 

which, for large k has real part Re(A±) « ±-\/|a||fc|/2. The eigenvalue with positive real part gives 
rise to solutions which, for fixed t, grow exponentially in |fc|. 

Example 11. For the system [343] . 

u t =A 1 u x +A 2 u v , A 1 = [ . _ ) , A 2 = 



2/' V 2 



the principal symbol, Po(ik) — i[ „ „/, , . \ ) , is diagonalizable for all vectors k — 

(fci, fe) € "S* 1 except for those with ki + k2 = 0. In particular, Po(ik) is diagonalizable for k = (1, 0) 
and k = (0,1). This shows that in general, it is not sufficient to check that the n matrices A 1 , 
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A 2 ,..., A" alone are diagonalizable and have real eigenvalues, one has to consider all possible 

n 

linear combinations ^ A^kj with k E ,S n_1 . 
3=1 

Example 12. Next, we present a system for which the eigenvectors of the principal symbol 
cannot be chosen to be continuous functions of k: 



u t = A 1 u x + A 2 u y + A 3 u z , A 1 = ( Q _ 1 

fci k 2 



A 2 



1 

1 



A 6 








fc-2 -k 



The principal symbol Po(ik) = i 

(fci,&2) 7^ (0,0) the corresponding eigenprojectors are 

1 ( X±(k) +iki 



) has eigenvalues \±(k) = ±i\fk 2 + k 2 and for 



P±{kiM) 



2A±(fc) 



/ h. 



ik 2 
A±(fc) — iki 



When (^1,^2) ~~ ► (0,0) the two eigenvalues fall together, and A(k) converges to the zero matrix. 
However, it is not possible to continuously extend P±(ki,k2) to (fei^fe) = (0,0). For example, 



P+(h,0) = 



1 




P+(-h,0) = 




1 



for positive h > 0. Therefore, any choice for the matrix S(k) which diagonalizes A{k) must be 
discontinuous at k = (0, 0, ±1) since the columns of S(k) are the eigenvectors of A(k). 

Of course, A(k) is symmetric and so S(k) can be chosen to be unitary which yields the trivial 
symmctrizer H(k) = I. Therefore, the system is symmetric hyperbolic and yields a well posed 
Cauchy problem; however, this example shows that it is not always possible to choose S(k) as a 
continuous function of k. 



Example 13. Consider the Klein-Gordon equation 

$ M = A$ - m 2 $, 



(21) 



in two spatial dimensions, where m £ R is a parameter which is proportional to the mass of the 
field $. Introducing the variables u = (<&, $t, & x , $ y ) we obtain the first order system 



Ui 



/0000\ /0000\ / 

10 1 

10 Ux+ Uy + 

yoooo/ \ 1 / V 






1 





\ 


2 

m 
































0/ 



The matrix coefficients in front of u x and u y are symmetric; hence the system is symmetric hyper- 
bolic with trivial symmetrizer H — diag(m 2 , 1, 1, l)r\ The corresponding Cauchy problem is well 
posed. However, a problem with this first order system is that it is only equivalent to the original, 
second order equation (21) if the constraints (ui) x = M3 and {u\) y = 114 are satisfied. 



An alternative symmetric hyperbolic first order reduction of the Klein-Gordon equation which 
does not require the introduction of constraints is the Dirac equation in two spatial dimensions, 



vt 



1 

1 



1 
-1 



Vl 

l"2 



(22) 



4 Here, the factor m? could in principle be replaced by any positive number, which shows that the symmetrizer is 
not always unique. The choice here is such that the expression u* Hu is proportional to the physical energy density 
of the system. Notice, however, that for the massless case m = one must replace m 2 = by a positive constant 
in order for the symmetrizer to be positive definite. 
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This system implies the Klein-Gordon equation (21 1 for either of the two components of v 



Yet another way of reducing second order equations to first order ones without introducing 
constraints will be discussed in Section f3. 1.51 below. 

Example 14. In terms of the electric and magnetic fields u — (E,B), Maxwell's evolution 
equations, 

Et = +VAB-J, (23) 

B t = -VAE, (24) 

constitute a symmetric hyperbolic system. Here, J is the current density and V and A denote the 
nabla operator and the vector product, respectively. The principal symbol is 

P °^V )= l { -kAE 
and a symmetrizer is given by the physical energy density, 

u*Hu=\{\E\ 2 + \B\ 2 ), 

in other words, H = 2~ 1 I is trivial. The constraints V • E = p and V • B = propagate as a 
consequence of Equations (23 24), provided that the continuity equation holds: (V • E — p) t — 



-V • J - p t = 0, (V ■ B) t = 0. 

Example 15. There are many alternative ways for writing Maxwell's equations. The following 
system |343|, I279J was originally motivated by an analogy with certain parametrized first order 
hyperbolic formulations of the Einstein equations, and provides an example of a system that can 
be symmetric, strongly, weakly or not hyperbolic at all, depending on the parameter values. Using 
the Einstein summation convention, the evolution system in vacuum has the form 

d t Ei = d j (W ij -W ji )-a{d i W j j -d j W ij ), (25) 

dtWij = -diEj - pi^Ek, (26) 



where Ei and Wij — diAj, i = 1,2, 3, represent the Cartesian components of the electric field and 
the gradient of the magnetic potential Aj , respectively, and where the real parameters a and /3 
determine the dynamics off the constraint hypersurface defined by d k Ek — and d^W^ — diWkj = 
0. 



In order to analyze under which conditions on a and j3 the system (25 26 ) is strongly hyperbolic 
we consider the corresponding symbol, 

(1 + a)k>Wij - tfWn - ahW 3 , \ _ ( Ei \ ,_ c2 



Decomposing E^ and Wij into components parallel and orthogonal to fcj, 

Ei = Eki + Ei, Wij = Wkikj + W.kj + kiVj + Wij + ^jU, 

where in terms of the projector ^ := 8^ ~ kiW orthogonal to k we have have defined E := k l Ei, 
E t ■- jSEj and W := kVWij, W t := Ji k W kj W, V 3 := k i W ik 'y k j , U := ^W tj , and W %J := 
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(7i fc 7j' 



2 1 7ii7 fc ')^ / fc/n we can wr ^e the eigenvalue problem P (ik)u = iXu as 



XE = 


= -alJ, 


XU = 


= ~PE, 


XW = 


■ "M)* 


XEi = 


\ / 

= (l + a)Wi-V 


xv t = 


= — -Si) 


XWi = 


= o, 


xw tj = 


= 0. 



It follows that Po(ik) is diagonalizable with purely complex eigenvalues if and only if aj3 > 0. 
However, in order to show that in this case the system is strongly hyperbolic one still needs to 
construct a bounded symmetrizer H(k). For this, we set /i := y/a/3 and diagonalize Po(ik) = 
i5'(fc)A(fc)5(fc)- 1 with A(fc) =diag(/i,-/i, 0,1, -1,0,0) and 



/ 



S(k)- 1 u = 



E-IU 
E+^U 



\ 



pw-(i + £\u 

%-Vi + (l + a)Wi 
Ei + Vi-_(1 + a)Wi 

m 
w i4 



V 

Then, the quadratic form associated with the symmetrizer is 
u*H{k)u = u^Siky^Sik)- 1 !! 



(27) 



= 2\E[ 



2~ p \U\ 2 



pw- 1 



u 



2E l E, 



+ 2 [V 1 - (1 + a)W l ] [Vi - (1 + a)W t ] + W l W t + W lJ W tj , 

and H (k) is smooth in k e S* 2 . Therefore, the system is indeed strongly hyperbolic for a/3 > 0. 

In order to analyze under which conditions the system is symmetric hyperbolic we notice that 
because of rotational and parity invariance the most general fc-independent symmetrizer must have 
the form 

u*Hu = a(E l )*Ei + b{W [i3] )*W M + c(W ij )*W l3 + dW*W, 

with strictly positive constants a, 6, c and d, where Wij :— W(y) — 5ijW/3 denotes the symmetric, 
trace-free part of Wij and W := W 3 j its trace. Then, 



u* H Po(ik)u 



ia(E l ) 



{a + 2)k j W [ij] + ak j W i 



+ ib(W [ij] )* E t k 3 - ic(W ij )*Eikj 



2a. 
~3 

id I 1 



k t W 
3/3 



W*k l E l 



For H to be a symmetrizer, the expression on the right-hand side must be purely imaginary. This 
is the case if and only if a(a + 2) = b, —aa = c and 2aa/3 = d(l + 3/3/2). Since a, b, c and d are 



5 Notice that Ei has only two degrees of freedom since it is orthogonal to k. Likewise, the quantities Wi, Vi have 
two degrees of freedom and Wij has three since it its orthogonal to k and trace-free. 
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positive, these equalities can be satisfied if and only if —2 < a < and f3 < —2/3. Therefore, if 
either a and f3 are both positive or a and /3 are both negative and a < —2 or (3 > —2/3, then the 



system (25 26) is strongly but not symmetric hyperbolic. 



3.1.5 Second order systems 

Another class of important systems in physics are wave problems. In the linear, constant coefficient 
case, they are described by an equation of the form 



f) 2 n F) Pi 

,xidx k ^ dx3 ^ 
3,k=l 3=1 i=i 



v *=Y, AJk w^ v+ ^ 2BJ w Vt+ ^ CJ w v+Dvt+Ev > " eE "' *- ' (28) 



where v = v(t,x) £ C m is the state vector, and A 13 — A°' i ,B : > ,C 3 ,D,E denote complex mx m 
matrices. In order to apply the theory described so far, we reduce this equation to a system which 

n 

is first order in time. This is achieved by introducing the new variable w := Vt — ^2 B J - 

3=1 

With this redefinition one obtains a system of the form (13]) with u = (v, w) T and 

P(d/d X ) = Y,B'g-+[ f :(A 3 k + B 3 Bk) _£_ + f :{C 3 +D . B 3 )i a j + E D 

3 = 1 \ j,k=l 3=1 

Now we could apply the matrix theorem, Theorem [II to the corresponding symbol P(ik) and 
analyze under which conditions on the matrix coefficients A % i , B^ , & ,D,E the Cauchy problem is 
well posed. However, since our problem originates from a second order equation, it is convenient to 
rewrite the symbol in a slightly different way: instead of taking the Fourier transform of v and w 
directly, we multiply v by |fc| and write the symbol in terms of the variable U := (\k\v, w) T . Then, 
the L 2 -norm of U controls, through Parseval's identity, the L 2 -norms of the first partial derivatives 
of v, as is the case for the usual energies for second order systems. In terms of U the system reads 



U t = Q(ik)U, t>0, ke W\ (29) 



in Fourier space, where 



_ 

Q(ik) = i\k\ £ b% + I _| fe | g {A]k + B 3 B *y k .% k + i g {C 3 + DB j)k 




3=1 \ j,k=l j=l 

with kj := kj/\k\. As for first order systems, we can split Q{ik) into its principal part, 

/ I \ 

Q Q {ik):=i\k\Y J B r k 3 + \k\ I j, (A? k + BW k )k 3 k k ' ( 31 ) 

3 = 1 V 3,k=l I 

which dominates for |fc| — ¥ oo, and the remaining, lower order terms. Because of the homogeneity 
of Qo(ik) in k we can restrict ourselves to values of k € S n ~ 1 on the unit sphere, like for first order 
systems. Then, it follows as a consequence of the matrix theorem that the problem is well posed 
if and only if there exists a symmetrizer H (k) and a constant M > satisfying 

Kr 1 ! < H(k) < MI, H(k)Q (ik) + Q {ik)*H(k) = 



5 Here, the advection term Y] _B J ifr« is subtracted from vt for convenience only. 

1=1 
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for all such k. Necessary and sufficient conditions under which such a symmetrizer exists have 
been given in Ref. |254j for the particular case in which the mixed second order derivative term in 



Equation (28 1 vanishes; that is, when B 3 — 0. This result can be generalized in a straightforward 



manner to the case where the matrices B 3 = j3 3 1 are proportional to the identity: 

Theorem 2. Suppose B 3 = f3' J I , j = 1, 2, . . . , n. (Note that this condition is trivially satisfied if 



m = I.) Then, the Cauchy problem for the equation (28) is well posed if and only if the symbol 



R(k) ■- J2 ( Aij + B l B 3 )k l k , k G S n ~\ 

has the following properties: there exist constants M > and 5 > and a family h{k) of Hermitian 
m x m matrices such that 

M' 1 ! < h(k) < MI, h(k)R(k) = R(k)*h(k) > 51 (32) 

for alike S 71 ' 1 . 

n A 

Proof. Since for B 3 = j3 3 1 the advection term i\k\ ^ B 3 kj commutes with any Hermitian matrix 

3=1 
H{k) 1 it is sufficient to prove the theorem for B 3 = 0, in which case the principal symbol reduces 

to 

11 I \ . _ nn-l 



Qom-=[ _ R(k) J. kes 

We write the symmetrizer H (fc) in the following block form, 

_ / H n (k) H 12 (k) 
( >~\ H 12 {kf H 22 (k) 

where Hn{k), H 22 {k) and H\ 2 {k) are complex m x m matrices, the first two being Hermitian. 
Then, 

R(k\n <ih\ + n UkY H(k\ - ( -Hi2{k)R{k) - R(k)*H 12 (kr H n (k) - R(k)*H 22 (k) 
H{k)Q (ik) + Q (ik) H(k)- I Hll (k) - H 22 (k)R(k) H 12 (k) + H 12 (k)* 



Now, suppose h(k) satisfies the conditions ( |32| ). Then, choosing H± 2 (k) := 0, H 22 (k) := h(k) and 
H n (k) := h(k)R(k) we find that H(k)Q {ik) + Q Q (ik)*H{k) = 0. Furthermore, M~ X I < H 22 {k) < 
MI and SI < H n (k) = h{k)R{k) < MCI where 

C := sup{|i?(fc)u| : k G 5 n_1 , u G C m , |u| = 1} 

is finite because R(k)u is continuous in k and u. Therefore, H{k) is a symmetrizer for Qo(ik), and 
the problem is well posed. 

Conversely, suppose that the problem is well posed with symmetrizer H(k). Then, the vanish- 
ing of H(k)Q (ik) + Q {ik)*H(k) yields the conditions H u (k) = H 22 (k)R(k) = R(k)*H 22 (k) and 
the conditions ([32]) are satisfied for h(k) :— H 22 (k). □ 



Remark: The conditions (32) imply that R(k) is symmetric and positive with respect to the 
scalar product defined by h(k). Hence it is diagonalizable, and all its eigenvalues are positive. A 
practical way of finding h(k) is to construct T(k) which diagonalizes R(k), T(k)^ 1 R(k)T(k) = P(k) 
with P(k) diagonal and positive. Then, h(k) := (T(fe) _1 )*T(fe) _1 is the candidate for satisfying 



the conditions ( 32 ) 
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Let us give some examples and applications: 

Example 16. The Klein-Gordon equation vu — At> — m 2 v on flat spacetime. In this case, 
j\ij — gij anc [ gj — Q ; anc [ R(k) — \k\ 2 trivially satisfies the conditions of Theorem ^1 

Example 17. In anticipation of the following subsection, where linear problems with variable 
coefficients are treated, let us generalize the previous example on a curved spacetime (M,g). We 
assume that (M, g) is globally hyperbolic such that it can be foliated by space-like hypersurfaces 
Yi t . In the ADM decomposition, the metric in adapted coordinates assumes the form 

g = -a 2 dt <gidt + -f ij (dx i + ftdt) ® (dx j + (3 j dt), 

with a > the lapse, (3 l the shift vector which is tangent to S t and jijdx 1 <S> dx^ the induced 
three- metric on the spacelike hypersurfaces T. t . The inverse of the metric is given by 



o 



i ( d al d \ (d oi d \ j, d d 



where 7 lJ are the components of the inverse three-metric. The Klein-Gordon equation on (M, g) 



is 

.9 



V- dct (a) v 



which, in the constant coefficient case, has the form of Equation ( 28 ) with 

Hence, R(k) — o?*fikikj, and the conditions of Theorem [2] are satisfied with h(k) = 1 since a > 
and j lJ is symmetric positive definite. 

3.2 Linear problems with variable coefficients 

Next, we generalize the theory to linear evolution problems with variable coefficients. That is, we 
consider equations of the following form: 

ut =P(t,x,d/dx)u= ^2 A v (t,x)D„u, lei", t > 0, (33) 

\"\<p 

where now the complex m x m matrices A v (t, x) may depend on t and x. For simplicity, we assume 
that each matrix coefficient of A v belongs to the class C£°([0, oo) x K") of bounded, C°°-functions 
with bounded derivatives. Unlike the constant coefficient case, the different fc-modes couple when 
performing a Fourier transformation, and there is no simple explicit representation of the solutions 
through the exponential of the symbol. Therefore, Definition [T] of well posedness needs to be 
altered. Instead of giving an operator-based definition, let us define well posedness by the basic 
requirements a Cauchy problem should satisfy: 

Definition 3. The Cauchy 'problem 

u t {t, x) = P(t, x, d/dx)u(t, x), xe R", t > 0, (34) 

u(0,x) = f(x), xeR n , (35) 

is well posed, if any f e C^°(M. n ) gives rise to a unique C°° -solution u(t,x), and if there are 
constants K > 1 and a £ K such that 

\\u(t,-)\\<Ke at \\f\\ (36) 

for all f e Cg°(R") and allt>0. 



22 



Before we proceed and analyze under which conditions on the operator P(t, x, d/dx) the Cauchy 



problem (34 35 1 is well posed, let us make the following observations: 



In the constant coefficient case, the inequality (36) is equivalent to the inequality ([9]), and in 
this sense Definition [3] is a generalization of Definition [l] 



• If u\ and u 2 are the solutions corresponding to the initial data fi, f% G Cq > (. 
difference u — u 2 — u\ satisfies the Cauchy problem (34 35) with / = fy 



estimate ( 36 ) implies that 

|Mt,-)-Wi(V)ll<i^ Qt ||/2-/i||, 



t>0. 



a ), then the 
/i and the 

(37) 



In particular, this implies that u 2 (t,-) converges to U\{t, •) if f% converges to f± in the L 2 - 
sense. In this sense, the solution depends continuously on the initial data. This property is 
important for the convergence of a numerical approximation, as discussed in Section [7] 



• The estimate (36) also implies uniqueness of the solution, because for two solutions u\ and 

l ) the inequality (37) implies u\ 



u 2 with the same initial data f\ = f 2 G C, 



o°°( 



u 2 . 



• As in the constant coefficient case, it is possible to extend the solution concept to weak 
ones by taking sequences of C°°-elements. This defines a propagator U(t,s) : L 2 (R n ) —> 
L 2 (R n ) which maps the solution at time s > to the solution at time t > s and satisfies 
similar properties than the ones described in Section 3.1.2 (i) U(t,t) — I for all t > 0, (ii) 
U(t,s)U( Sl r) = U(t,r) for all t > s > r > 0, (hi) for / E C^{W l ), U(t,0)f is the unique 
solution of the Cauchy problem (J34j [35}, (iv) \\U{t,s)f\\ < lfe a (*- s )||/|| for all / € L 2 {R) 
and all t > s > 0. Furthermore, the Duhamel formula |l~5"l ) holds with the replacement 
U(t-s) h->[7(t,s). 



3.2.1 The localization principle 

Like in the constant coefficient case, we would like to have a criterion for well posedness that is 
based on the coefficients A v (t, x) of the differential operator alone. As we have seen in the constant 
coefficient case, well posedness is essentially a statement about high frequencies. Therefore, we are 
led to consider solutions with very high frequency or, equivalently, with very short wavelength. In 
this regime we can consider small neighborhoods and since the coefficients A u (t, x) are smooth, 
they are approximately constant in such neighborhoods. Therefore, intuitively, the question of 
well posedness for the variable coefficient problem can be reduced to a frozen coefficient problem, 
where the values of the matrix coefficients A u (t, x) are frozen to their values at a given point. 

In order to analyze this more carefully, and for the sake of illustration, let us consider a first 
order linear system with variable coefficients 



Ut = P(t, x, d/dx)u 



d 

z ^A ] (t,x)g—u + B(t,x)u, 



E 

3 = 1 



X G 



t>0, 



(38) 



where A 1 , . . . , A n , B are complex mxm matrices whose coefficients belong to the class C£° ( [0, oo) x 
M. n ) of bounded, C°°-functions with bounded derivatives. As mentioned above, the Fourier trans- 
form of this operator does not yield a simple, algebraic symbol like in the constant coefficient casejj 

7 The Fourier transform of the function v(t, •) = P(t, ■, d/dx)u(t, •) in space is formally given by 

1 



0(t,.) : 



(2tt)"/ 2 



^2 A 3 (t, ■) * ikju(t, ■) + B(t, ■) * u(t, ■) 

3=1 



where A J (t, ■), B(t, •) denote the Fourier transform of A J (i, ■) and B(t, •), respectively, and where the star denotes 
the convolution operator. Unless A> and B are independent of x, the different fc-modes couple to each other. 
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However, given a specific point p = (t , x ) £ [0, oo) x M n , we may zoom into a very small neigh- 
borhood of pq. Since the coefficients A>(t,x) and B(t,x) are smooth, they will be approximately 
constant in this neighborhood and we may freeze the coefficients of A?{t,x) and B(t,x) to their 
values at the point pq. More precisely, let u(t, x) be a smooth solution of the equation ( [38] ) . Then, 
we consider the formal expansion 

u{t +et,x +ex)=u{t 0l x )+eu {1) {t,x)+e 2 u {2) {t 1 x) + ..., e > 0. (39) 



.(%+ ^j.«.( 2 ) 



38 ) one obtains 








n 


[duP>, s du (2 \ , 1 
„ . (t, x) + e - . (t, x) + ... 
dxi K ' dxi ' 


+ B(t + et,x + ex) 


u(t , x a ) + ev> l > (t,x) + ... 





(t,x) + euf>(t,x) + 



Taking the pointwise limit e — > on both sides of this equation we obtain 

u^^x) = ^A j {to,x )^-{t,x) + F = P (t ,x Q ,d/dx)u W (t,x) + F , 

where Fo := B(t , x )u(to,XQ). Therefore, if u is a solution of the variable coefficient equa- 
tion Ut = P(t,x,d/dx)u, then, ir 1 ' satisfies the linear constant coefficient problem u\ (t.x) = 
Po(to, xq, d/dx)vf- l > +Fq obtained by freezing the coefficients in the principal part of P(t, x, d/dx)u 
to their values at the point pq and by replacing the lower order term B(t,x)u by the forcing term 
Fq. By adjusting the scaling of t, a similar conclusion can be obtained when P(t,x,d/dx) is a 
higher-derivative operator. 

This leads us to the following statement: a necessary condition for the linear, variable coefficient 
Cauchy problem for the equation u t = P(t, x, d /dx)u to be well posed is that all the corresponding 
problems for the frozen coefficient equations v t = Po(to,xo,d/dx)v are well posed. For a rigorous 
proof of this statement for the case in which P(t, x, d/dx) is time-independent, see Ref. [385J. We 
stress that it is important to replace P(t, X, d/dx) by its principal part Po(t, x, d/dx) when freezing 
the coefficients. The statement is false if lower order terms are retained, see Refs. 253, 385 for 
counterexamples. 

Now it is natural to ask whether or not the converse statement is true: suppose that the Cauchy 
problems for all frozen coefficient equations vt = Po(to,Xo,d/dx)v are well posed; is the original, 
variable coefficient problem also well posed? It turns out this localization principle is valid in 
many cases under additional smoothness requirements. In order to formulate the latter, let us go 



back to the first order equation ( 38 ) . We define its principal symbol as 



Po(t,x,ik):=iy^A j (t,x)kj. (40) 






In analogy to the constant coefficient case we define: 



Definition 4. The first order system (38) is called 

(i) weakly hyperbolic if all the eigenvalues of its principal symbol Po(t,x,ik) are purely imag- 
inary. 

(ii) strongly hyperbolic if there exist M > and a family of positive definite, Hermitian m x m 
matrices H(t, x, k), (t, x,k) £ fix S n ~ l , whose coefficients belong to the class C^°(Q x S" 1 ^ 1 ), 
such that 

Ar 1 ! < H(t,x,k) < MI, H(t,x,k)P Q (t,x,ik) + P a (t,x,ik)*H(t,x,k) = 0, (41) 
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for all (t,x,k) eflx S n ~ l , where ft := [0, oo) X R n . 

(Hi) symmetric hyperbolic if it is strongly hyperbolic and the symmetrizer H{t,x,k) can be 
chosen independent of k. 

We see that these definitions are straight extrapolations of the corresponding definitions (see 
Definition pi) in the constant coefficient case, except for the smoothness requirements for the sym- 
metrizer H(t, x,k)P\ There are examples of ill posed Cauchy problems for which a Hermitian, 
positive definite symmetrizer H(t, x, k) exists but is not smooth, see Ref. [385], showing that these 
requirements are necessary in general. 

The smooth symmetrizer is used in order to construct a pseudo-differential operator 

[H(t)v](x) := j^j- 2 J H(t,x,k/\k\y k -*v(k)d n k, v(k) = ^-^ J e -^v(x)d n x, 
from which one defines a scalar product (•, -)ff(t) which, for each t is equivalent to the L 2 product. 



This scalar product has the property that a solution u to the equation ( 38 ) satisfies an inequality 
of the form 

-j t (u,u) H{t} <b(T)(u,u) H (t), 0<t<T, 



see, for instance, Ref. 399 . Upon integration this yields an estimate of the form of Equation (36 1. 
In the symmetric hyperbolic case, we have simply [H(t)v] = H(t,x)v(x) and the scalar product is 
given by 

(u, v) H{t) := [ u(x)*H(t, x)v(x)d n x, u, v e L 2 (R n ). 

We will return to the application of this scalar product for deriving energy estimates below. Let 
us state the important result: 



Theorem 3. If the first order system \3lty is strongly or symmetric hyperbolic in the sense of 
Definition^ then the Cauchy problem (34 \35) is well posed in the sense of Definitions 

For a proof of this theorem, see for instance Proposition 7.1 and the comments following its 
formulation in chapter 7 of Ref. [399]. Let us look at some examples: 



Example 18. For a given, stationary fluid field, the non-relativistic, ideal magnetohydrodynamic 
equations reduce to the simple system |117j 

B t = V A (v A B) (42) 

for the magnetic field B, where v is the fluid velocity. The principal symbol for this equation is 
given by 

P {x, ik)B = ikh (v(x) A B) = {ik ■ B)v{x) - (ik ■ v(x))B. 

In order to analyze it, it is convenient to introduce an orthonormal frame ei, e^i e 3 such that e\ is 
parallel to k. With respect to this, the matrix corresponding to Po(x,ik) is 



v 2 (x) —Vi(x) 

v 3 (x) -vi(x) 

with purely imaginary eigenvalues 0, —i\k\vi(x). However, the symbol is not diagonalizable when 
k is orthogonal to the fluid velocity, v\(x) = 0, and so the system is only weakly hyperbolic. 



'These smoothness requirements arc sometimes omitted in the numerical relativity literature. 
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One can still show that the system is well posed if one takes into account the constraint 
V • B = 0, which is preserved by the evolution equation (42). In Fourier space, this constraint 
forces B\ = 0, which eliminates the first row and column in the principal symbol, and yields a 
strongly hyperbolic symbol. However, at the numerical level, this means that special care needs 



to be taken when discretizing the system ( 42 ) since any discretization which does not preserve 
V • B = will push the solution away from the constraint manifold in which case the system 
is weakly hyperbolic. For numerical schemes which explicitly preserve (divergence-transport) or 
enforce (divergence-cleaning) the constraints, see Refs. [227] and |133| . respectively. For alternative 
formulations which arc strongly hyperbolic without imposing the constraint, see Ref. |117j . 



Example 19. The localization principle can be generalized to a certain class of second order 
systems [254 1 [300 : For example, we may consider a second order linear equation of the form 






d 2 



dx J dx k 



■j2 2BJ fr*) 



d 
dx~J 



vi - 



d_ 
'dxi 



-Y j a{t 1 x)^-v + D{t 1 x)v t +E(t 1 x)v 1 (43) 

3=1 



x G W\ t > 0, where now the m x m matrices A J , B 1 , C- 7 , D and E belong to the class 
C£°([0, oo) x K") of bounded, C°°-functions with bounded derivatives. Zooming into a very small 
neighborhood of a given point po = (t n ,xo) by applying the expansion in Equation (39) to v, one 
obtains, in the limit e — > 0, the constant coefficient equation 



(2) 
Vtt 



n 8 2 vW " r)?; (2) 

(t,x) = ]T Alk ( t o^o)g-g^(t,x) + ^BHt ,x )^-(t,x)-\r l: . 

j,k=l ' ' ' '' i = l 



dxJ 



(44) 



with 



F := y^C j (to,x )-—j(to,Xo) + D(to,xo)v t (to,Xo) + E(t ,xo)v(to,x ), 



3 = 1 



dxi 



where we have used the fact that v^(t,x) — tvt(to,xo) + X) ^g^H^cb^o)- Equation (44) can be 

]=} 
rewritten as a first order system in Fourier space for the variable 



\k\v 



U = 



v t -iJ2 Bi(t ,x )kjV 
i=i 



see Section 3.1.5 Now Theorem [2] implies that the problem is well posed if there exist constants 
M > and 5 > and a family of positive definite m x m Hcrmitian matrices h(t, x, k), (t, x, k) <G 
ft x S n ~ l , which is C°°-smooth in all its arguments such that M~ l I < h(t,x,k) < MI and 
h(t,x,k)R(t,x,k) = R(t,x,k)*h(t,x,k) > 51 for all (t,x,k) e ft x 5 n_1 , where R(t,x,k) := 

n 

J2 (AV (t,x) + B l (t 1 x)Bi(t,x))k l k 3 . 

In particular, it follows that the Cauchy problem for the Klein-Gordon equation on a globally 
hyperbolic spacetime M = [0,oo) x W 1 with a,/3*,7ij € C^°([0, oo) x M. n ), is well posed provided 
that a 2 ^ is uniformly positive definite, see Example 17 



3.2.2 Characteristic speeds and fields 



Consider a first order linear system of the form (38) which is strongly hyperbolic. Then, for 
each t > 0, x e R™ and k £ 5 n_1 the principal symbol Po(t,x,ik) is diagonalizable and has 



2G 



purely complex eigenvalues. In the constant coefficient case with no lower order terms (B = 0) an 
eigenvalue ifx(k) of Po(ik) with corresponding eigenvector a(k) gives rise to the plane wave solution 

u{t, x) = a{k)e^ {k)t+lk - x , (>0,ieK n (45) 

If lower order terms arc present and the matrix coefficients A 3 (t, x) are not constant one can look 
for approximate plane wave solutions which have the form 

u(t,x)=a 6 (t,x)e iE ~ 1,p(t ^\ t>0,xeR n , (46) 

where e > is a small parameter, ip(t,x) a smooth phase function and a e {t,x) — ao(t,x)+eai(t,x) + 



e 2 a,2(t,x) + ... a slowly varying amplitude. Introducing the ansatz (46) into Equation (38) and 
taking the limit e — > yields the problem 

iip t a = Po(t,x,iVtp)a = i^A 3 (t,x)-—a . (47) 

Setting uj(t,x) := ipt{t,x) and k(t,x) := V - 0(i,x), a nontrivial solution exists if and only if the 
cikonal equation 

det[iuI-P (t,x,ik)] = (48) 

is satisfied. Its solutions provide the phase function tp(t,x) whose level sets have conormal uidt 



k ■ dx. The phase function and ao determine approximate plane wave solutions of the form (46). 
For this reason we call tu(k) the characteristic speed in the direction k £ 5 n_1 , and ao a 
corresponding characteristic mode. For a strongly hyperbolic system, the solution at each point 
(t,x) can be expanded in terms of the characteristic modes 6j(t,x,k) with respect to a given 
direction k e S n ~ l , 

m 

u(t, x) = 2_, u^ 3 '(t,x, k)ej(t,x, k). 

The corresponding coefficients u^\t,x,k) are called the characteristic fields. 

Example 20. Consider the Klein-Gordon equation on a hyperbolic spacetime, like in Example [T7| 
In this case the eikonal equation is 

= det [i.I - Q (ik)] = det ( _^^/] fc| z(cj %. } ) = -(" - W + ^hk 3 , 

which yields oj±(k) — ftkj ± a^J^kikj. The corresponding conormals uj±(k)dt + kjdx 3 is null; 
hence the surfaces of constant phase are null surfaces. The characteristic modes and fields are 

6±(fc) = ( Ta^hkj ) ' ui±){k) = 2 [W\ T a^m) ' 
where U = {U\, U^) — (\k\v : v t — i^kjv) and v is the Klein-Gordon field. 



Example 21. In the formulation of Maxwell's equations discussed in Example 15 the character- 



istic speeds are 0, ±y/a/3 and ±1, and the corresponding characteristic fields are the components 



of the vector on the right-hand side of Equation ( 27 1 
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3.2.3 Energy estimates and finite speed of propagation 

Here we focus our attention on first order linear systems which are symmetric hyperbolic. In this 
case it is not difficult to derive a priori energy estimates based on integration by parts. Such 
estimates assume the existence of a sufficiently smooth solution and bound an appropriate norm 
of the solution at some time t > in terms of the same norm of the solution at the initial time 
t = 0. As we will illustrate here, such estimates already yield quite a lot of information on the 
qualitative behavior of the solutions. In particular, they give uniqueness, continuous dependence 
on the initial data and finite speed of propagation. 

The word "energy" stems from the fact that for many problems the squared norm satisfying 
the estimate is directly or indirectly related to the physical energy of the system, although for 
many other problems the squared norm does not have a physical interpretation of any kind. 

For first order symmetric hyperbolic linear systems, an a priori energy estimate can be con- 
structed from the symmetrizer H (t, x) in the following way. For a given smooth solution u(t, x) of 



Equation (38 1, define the vector field J on il — [0, oo) x M™ by its components 

J»(t,x) :=-u(t,x)*H(t,x)A ll (t,x)u(t,x), /x = 0, 1,2, . . . ,n, (49) 

where A (i, x) := —I. By virtue of the evolution equation, J satisfies 

fe=i 



d^(t, x) = ^ J°(t, x) + Y, Qx^ jk ^ x ) = u & X Y K & x ) u & «)> ( 5 °) 



where the Hermitian m x m matrix K(t,x) is defined as 

- 8 
K(t, x) := H(t, x)B(t, x) + B(t, xfH(t, x) + H t (t, x)-^^ (#(*> ^^(^ x )) ■ 

fc=i 



If K = 0, Equation (50 1 formally looks like a conservation law for the current density J. If 
K ^ we obtain, instead of a conserved quantity, an energy-like expression whose growth can 
be controlled by its initial value. For this, we first notice that our assumptions on the matrices 
H(t,x), B(t,x) and A k (t,x) imply that K(t,x) is bounded on f2. In particular, since H(t,x) is 
uniformly positive, there is a constant a > such that 

K(t, x) < 2aH(t, x), (t, x) € ft. (51) 

Let fix — Uo<t<T ^* ^ e a tubular region obtained by piling up open subsets S t of t = const 
hypersurfaces. This region is enclosed by the initial surface £q> the final surface T,t and the 



boundary surface T := Uo<t<T ®^t wn ich is assumed to be smooth. Integrating Equation (50) 
over £It and using Gauss' theorem, one obtains 

J°(t, x)d n x = f J°(t, x)d"x - { e^(t, x)dS + f u{t, x)*K(t, x)u{t, x)dtd n x, (52) 

2-i 'j' 2-i q / it 'j' 

where e^ is the unit outward normal covector to T and dS the volume element on that surface. 
Defining the "energy" contained in the surface E t by 

E(S t ):= f J°(t,x)d n x= J u{t,x)*H(t,x)u(t,x)d n x (53) 
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and assuming for the moment that the "flux" integral over T is positive or zero one obtains the 
estimate 

E(E T ) < E(T, Q )+ f f u(t,x)*K{t 1 x)u(t,x)d n x\dt 

T 

< E(S )+2a I E(£ t )dt, (54) 



where we have used the inequality (51 1 and the definition of E(Ht) hi the last step. Defining the 



(h(t)e- 2at ) < £(£ )e~ 2Qt , < t < T, 



function h(T) := L E(Ti t )dt this inequality can be rewritten as 

d 
dt 

which yields ah(T) < i?(£n)(e 2aT — 1) upon integration. This together with (54 1 gives 

£(E t ) < e 2at E(Z ), 0<t<T, (55) 

which bounds the energy at any time t € [0, T] in terms of the initial energy. 

In order to analyze the conditions under which the flux integral is positive or zero, we examine 
the sign of the integrand e M J M (i,x). Decomposing e^dx^ — N[adt + s\dx l + . . . + s 2 dx n ] where 
s = (si, . . . , s„) is a unit vector and N > a positive normalization constant, we have 

e p J M (i, x) = N(t, x) u(t, x)*[a(t, x)H(t, x) ~ H(t, x)P (t, x, s)]u(t, x), 

n 

where Po(t, x, s) = J^ A 3 (t, x)sj is the principal symbol in the direction of the unit vector s. This 

3=1 

is guaranteed to be positive if the boundary surface T is such that a(t, x) is greater than or equal 
to all the eigenvalues of the boundary matrix Po(t,x,s), for each (t, x) E T. This is equivalent 
to the condition 

,, n. u*H(t,x)P (t,x,s)u 

a(t, x) > sup for all (t, x) G T. (56) 

ueC m ,«^o u*H(t,x)u 

Since H(t, x)Po(t, x, s) is symmetric, the supremum is equal to the maximum eigenvalue of Po{t, x, s). 



Therefore, condition (56 1 is equivalent to the requirement that a(t, x) be greater than or equal to 
the maximum characteristic speed in the direction of the unit outward normal s. 
With these arguments, we arrive at the following conclusions and remarks: 

• Finite speed of propagation. Let po = (to,xo) € £7 be a given event, and set 

, x f U*H(t,x)Pn(t,X,s)u „ _„ „_ 1 ^ m , „ 

v (to) := sup <^ v ', \ ' ; : < t < t , x € W 1 , s € S" 1-1 , u GC m ,u^0 

(_ u*H(t,x)u 

Define the past cone at po a^j 

C-(p ) := {(t, x)en:\x\< v(t )(t ~ t)}. (57) 

9 In principle, the maximum propagation speed v(to) could be infinite. Finiteness can be guaranteed, for instance, 
by requiring the principal symbol Po(t,x,s) to be independent of (t,x) for \x\ > R outside a large ball of radius 
R> 0. 
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The unit outward normal to its boundary is e^dx" — N[v(to)dt + x ■ dx/\x\\ which satisfies 
the condition (56 1. It follows from the estimate (55) applied to the domain VLt — C~(po) 



that the solution u is zero on C~(po) if the initial data is zero on the intersection of the 
cone C~ (po) with the initial surface t = 0. In other words, a perturbation in the initial data 
outside the ball \x\ < v(to)to does not alter the solution inside the cone C~(po). Using this 
argument, it also follows that if / has compact support, the corresponding solution u{t, •) 
also has compact support for all t > 0. 

• Continuous dependence on the initial data. Let / e C^°(IR n ) be smooth initial data 
with compact support. As we have seen above, the corresponding smooth solution u(t, ■) 



also has compact support for each t > 0. Therefore, applying the estimate (55 1 to the case 
T, t := {£} x Hi™, the boundary integral vanishes and we obtain 



£(£ t ) < e 2at E(X ), 



t>0. 



In view of the definition of E(T, t ), see Equation (53), and the properties (41) of the sym- 
metrizer it follows that 



|Ki,-)||<Me a *||/||, i> , 



(58) 



which is of the required form, see Definition [3J In particular, we have uniqueness and con- 
tinuous dependence on the initial data. 

• The statements about finite speed of propagation and continuous dependence on the data 
can easily be generalized to the case of a first order symmetric hyperbolic inhomogeneous 
equation Ut = P(t,x,d/dx)u + F(t,x), with F : fl — > C m a bounded, C°°-function with 



bounded derivatives. In this case, the inequality (58) is replaced by 



\u(t,-)\\<Me c 



l/ll 



\F(s,-)\\ds 



t>0. 



(59) 



If the boundary surface T does not satisfy the condition ( 56 1 for the boundary integral to be 



positive, then suitable boundary conditions need to be specified in order to control the sign 
of this term. This will be discussed in Section 15.21 



• Although different techniques have to be used to prove them, very similar results hold for 
strongly hyperbolic systems, see Ref. |343j . 



Example 22. We have seen that for the Klein-Gordon equation propagating on a globally 
hyperbolic spacetime, the characteristic speeds are the speed of light. Therefore, in the case of a 
constant metric (i.e., Minkowksi space), the past cone C~(po) defined in Equation (57) coincides 
with the past light cone at the event po- A slight refinement of the above argument shows that the 
statement remains true for a Klein-Gordon field propagating on any hyperbolic spacetime. 



Example 23. 

Example 



15 



In Example [2X1 we have seen that the characteristic speeds of the system given in 
are 0, ±\faf5 and ±1, where a/3 > is assumed for strong hyperbolicity. Therefore, 
the past cone C~(po) corresponds to the past light cone provided that < a/3 < 1. For a/3 > 1, the 
formulation has superluminal constraint-violating modes, and an initial perturbation emanating 
from a region outside the past light cone at po could affect the solution at po- I n this case, the 
past light cone at po is a proper subset of C~(po). 
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3.3 Quasilinear equations 

Next, we generalize the theory one more step and consider evolution systems which are described by 
quasilinear partial differential equations, that is, by nonlinear partial differential equations which 
are linear in their highest order derivatives. This already covers most of the interesting physical 
systems, including the Yang-Mills and the Einstein equations. Restricting ourselves to the first 
order case, such equations have the form 

n ft 

u t =^A ] (t,x,u)-—u + F(t,x,u), 0<t<T, xeR n , (60) 

i=i 

where all the coefficients of the complex mxm matrices A l (t, x, u), . . . , A n (t, x, u) and the nonlinear 
source term F(t,x,u) € C m belong to the class C fc °°([0,T] x R" x C m ) of bounded, C°°-functions 
with bounded derivatives. Compared to the linear case, there are two new features the solutions 
may exhibit: 

• The nonlinear term F(t, x, u) may induce blowup of the solutions in finite time. This is 

already the case for the simple example where to = 1, all the matrices A 3 vanish identically 



and F(t,x,u) = u 2 , in which case Equation (60) reduces to u t = u 



• In contrast to the linear case, the matrix functions A J in front of the derivative operator 
now depend pointwise on the state vector u itself, which implies, in particular, that the 
characteristic speeds and fields depend on u. This can lead to the formation of shocks where 
characteristics cross each other, like in the simple example of Burger's equation u t = uu x 
corresponding to the case to = n = 1, A (t, X,u) — u and F(t, x, u) = 0. 

For these reasons, one cannot expect global existence of smooth solutions from smooth initial data 
with compact support in general, and the best one can hope for is existence of a smooth solution 
on some finite time interval [0,T], where T might depend on the initial data. 

Under such restrictions, it is possible to prove well posedness of the Cauchy problem. The idea 
is to linearize the problem and to apply Banach's fixed point theorem. This is is discussed next. 

3.3.1 The principle of linearization 



Suppose v,(°'(t, x) is a C°° (reference) solution of Equation (60 1, corresponding to initial data 
f(x) = v,(°'(0,x). Assuming this solution to be uniquely determined by the initial data /, we may 
ask if a unique solution u also exists for the perturbed problem 

" d 

u t (t,x) = ^2A :1 (t,x,u)-—u(t,x) + F(t,x,u)+5F(t,x), xE« n , < t < T, (61) 

i=i 

u(0, x) = f(x) + Sf(x), x £ K n , (62) 

where the perturbations 6F(t,x) and Sf(x) belong to the class of bounded, C°°-functions with 
bounded derivatives. This leads to the following definition: 



Definition 5. Consider the nonlinear Cauchy problem given by Equation (60) and prescribed 
initial data for u at t = 0. Let u^ ' be a C°° -solution to this problem which is uniquely determined 
by its initial data f. Then, the problem is called well posed at w ', if there are normed vector 
spaces X , Y , and Z and constants K > 0, e > such that for all sufficiently smooth perturbations 
5f and SF lying in Y and Z , respectively, with 

\\Sf\\ Y + \\SF\\ z <e, 
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the perturbed problem (61, 62) is also uniquely solvable and the corresponding solution u satisfies 
u — u^> £ X and the estimate 



u, ■ 



,(°)| 



X 



<K(\\Sf\\ 



Y 



\SF\\ 



(63) 



Here, the norms X and Y appearing on both sides of Equation ( 63 ) are different from each 
other because \\u — u^\\x controls the function u — u^ over the spacetime region [0, T] x R™ while 
|| Sf || y is a norm controlling the function Sf on W 1 . 

If the problem is well posed at u*- ' we may consider a one-parameter curve f e of initial data 
lying in C§°(R n ) that goes through / and assume that there is a corresponding solution u 6 (t,x) 
for each small enough |e| which lies close to u^ ' in the sense of inequality (63). Expanding 



u e (t,x) = u {0) (t,x) + ev (1) (t,x) + e 2 v {2 \t,x) + 



and plugging into the Equation (60) we find, to first order in e 



v^ =j2Ai(t, X )£jv 



^+B (t,x)v^\ 



(64) 



with 



A J (t, x) = A>(t, x, «W(t, x)), B (t, x) = 



-Q^(t,x, u(°\t,x))— + —(t,x,vS°i(t,x)). 



Equation (64) is a first order linear equation with variable c oeffi cients for the first variation, 
i/ 1 ', for which we can apply the theory described in Section 3.2 Therefore, it is reasonable 
to assume that the linearized problem is strongly hyperbolic for any smooth function w°'(t,x). 
In particular, if we generalize the definitions of strongly and symmetric hyperbolicity given in 
Definition Pfl to the quasilinear case by requiring that the symmetrizer H(t,x,k,u) has coefficients 
in C£°(n x S*" -1 x C m ), it follows that the linearized problem is well posed provided that the 
quasilinear problem is strongly or symmetric hyperbolic. 

The linearization principle states that the converse is also true: the nonlinear problem is 
well posed at u^ if all the linear problems which are obtained by linearizing Equation (60) at 



functions in a suitable neighborhood of u^°' are well posed. To prove that this principle holds, one 
sets up the following iteration. We define the sequence u^ of functions by iteratively solving the 
linear problems 



(k+i) 



d 



= ^A :i {t,x 1 u {k) )-—u {k+1) +F{t,x 1 u (k) ) + 5F{t 1 x), x € W 1 , < t < T, (65) 



i=i 



S k+1 \0,x) = f(x) + 5f(x), xe 



(66) 



for k = 0,1,2,... starting with the reference solution u' '. If the linearized problems are well 
posed in the sense of Definition k3l for functions lying in a neighborhood of vS°\ one can solve each 
Cauchy problem (65 66), at least for small enough time 71-. The key point then, is to prove that 
Tfe does not shrink to zero when k — >■ oo and to show that the sequence u^ k > of functions converges 



to a solution of the perturbed problem (61 62 ) . This is, of course, a nontrivial task which requires 



controlling it" and its derivatives in an appropriate way. For particular examples where this 
program is carried through, see |253) . For general results on quasilinear symmetric hyperbolic 
systems, see [213 USB HOD]- 
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3.4 Abstract evolution operators 

A general framework for treating evolution problems is based on methods from functional analysis. 
Here, one considers a linear operator A : D(A) c X — > X with dense domain, D(A) = X, in a 
Banach space X and asks under which conditions the Cauchy problem 

u t {t)=Au{t), i>0, (67) 

«(0) - /, (68) 

possesses a unique solution curve, i.e., a continuously differentiable map u : [0, oo) — > D(A) C X 



satisfying Equations (67 681) for each / £ D(A). Under a mild assumption on A this turns out to 
be the case if and only if lu | the operator A is the infinitesimal generator of a strongly continuous 
semigroup P(t), that is, a map P : [0, oo) — > C(X), with C(X) denoting the space of bounded, 
linear operators on X, with the properties that 

(i) P(P) = I, 

(ii) P(t + s) = P{t)P(s) for all t, S > 0, 
(iii) lim P(i)u = u for all u £ X, 

y ' 4-vO 

(iv) D{A) = |u e X : lim | [P(£)u - u] exists in X j and Au = lim | [P(t)u -v],u£ D(A). 



In this case, the solution curve u of the Cauchy problem (67 68 1 is given by u(t) = P(t)f, t>0, 
f G D{A). One can show [318, 50 that P{t) always possesses constants K > 1 and a £ K such 
that 

\\P{t)\\ < Ke a \ i>0, (69) 

which implies that ||u(i)|| < i^e a '||/|| for all / £ D{A) and all t > 0. Therefore, the semigroup 
P(t) gives existence, uniqueness and continuous dependence on the initial data. 

There are several results giving necessary and sufficient conditions for the linear operator A 
to generate a strongly continuous semigroup, see, for instance, Refs. [3181 150] . One useful result, 
which we formulate for Hilbert spaces, is the following: 

Theorem 4 (Lumer-Phillips) . Let X be a Hilbert space with scalar product (•,•)> an d ^ -^ : 
D(A) C X — > X be a linear operator. Let a £ R. Then, the following statements are equivalent: 

(i) A is the infinitesimal generator of a strongly continuous semigroup P(t) such that ||P(i)|| < 
e at for allt>0. 

(ii) A — al is dissipative, that is, Re(u, Au — au) < for all u £ D(A), and the range of A — XL 
is equal X for some A > a. 

Example 24. As a simple example consider the Hilbert space X = L 2 (R n ) with the linear 
operator A : D(A) C X — >• X defined by 

D{A) := {u £ X : (1 + \k\ 2 )Fu £ L 2 (M n )}, 
Au:=Au = -J 7 - 1 {\k\ 2 J : u), u€D(A), 

where T denotes the Fourier-Plancharel operator, see Section [2] Using Parseval's identity, we find 

Re(u,Au) = Rc(Tu,-\k\ 2 Tu) = -\\(\k\Tu)\\ 2 < 0, 



10 See Theorem 4.1.3 in Ref. EH 
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hence A is dissipative. Furthermore, let v G L 2 (R n ), then 

x-i / Fv 
u := J 7 



1 + |fc| 2 

defines an element in D(A) satisfying (/ — A)u = u — Au = v. Therefore, the range of A — I is 
equal to X, and Theorem H implies that A — A generates a strongly continuous semigroup P(t) 
on X such that ||P(i)|| < 1 for all t > 0. The curves u(t) ■- P(t)f, t > 0, / e L 2 (IR™) are the 
weak solutions to the heat equation on M. n , see Section [3.1. 2| 

In general, the requirement for A — al to be dissipative is equivalent to finding an energy 
estimate for the squared norm E :— \\u\\ 2 of u. Indeed, setting u(t) :— P(t)f and using u t = AP(t)f 
we find 

-r--E(t) = ^-||"WH 2 = 2Re(u(t),Au(t)) < 2a\\u(t)\\ 2 = 2aE(t) 

for all i > and / € D(A), which yields the estimate 

\\u(t)\\ <e at \\f\\, t>0, (70) 

for all / € D(A). Given the dissipativity of A — al, the second requirement, that the range of 
A — XI is X for some A > a, is equivalent to demanding that the linear operator A— XI : D(A) — > X 
be invertible. Therefore, proving this condition requires solving the linear equation 

Au — Xu = v 

for given v € X. This condition is important for the existence of solutions, and shows that 
for general evolution problems requiring an energy estimate is not sufficient. This statement is 
rather obvious, because given that A — al is dissipative on D(A), one could just make D(A) 
smaller, and still have an energy estimate. However, if D(A) is too small, the Cauchy problem 
is over-determined and a solution might not exist. We will encounter explicit examples for this 
phenomena in Section [51 when discussing boundary conditions. 

Finding the correct domain D(A) for the infinitesimal generator A is not always a trivial task, 
especially for equations involving singular coefficients. Fortunately, there are weaker versions of the 
Lumer- Phillips theorem which only require checking conditions on a subspace D C D(A) which 
is dense in X. It is also possible to formulate the Lumer-Phillips theorem on Banach spaces. See 
Refs. [3T51 US |SD] for more details. 

The semigroup theory can be generalized to time-dependent operators A(t), and to quasilinear 
equations where A(u) depends on the solution u itself. We refer the reader to Ref. 50 for these 
generalizations and for applications to examples from mathematical physics including general rela- 
tivity. The theory of strongly continuous semigroups has also been used for formulating well posed 
initial-boundary value formulations for the Maxwell equations [340) and the linearized Einstein 
equations [301] with elliptic gauge conditions. 
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4 Initial Value Formulations for Einstein's Equations 

In this section we apply the theory discussed in the previous section to well posed Cauchy formu- 
lations of Einstein's vacuum equations. The first such formulation dates back to the fifties [164] 
and will be discussed in Section |4.1| Since then, there has been a plethora of new formulations 
which distinguish themselves by the choice of variables (metric vs. tetrad, Christoffcl symbols vs. 
connection coefficients, inclusion or not of curvature components as independent variables, etc.), 
the choice of gauges and the use of the constraint equations in order to modify the evolution equa- 
tions off the constraint surface. Many of these new formulations have been motivated by numerical 
calculations which try to solve a given physical problem in a stable way. 

By far the most successful formulations for numerically evolving binary compact objects have 
been the harmonic system, which is based on the original work of Ref . |164j , and the Baumgarte- 
Shapiro-Shibata-Nakamura (BSSN) one 378, 43 . For this reason, we review these two formulations 
in detail in Sections |4.1| and |4.3| respectively. In Section |4.2| we also review the Arnotwitt-Deser- 
Misner (ADM) formulation [25J , which is based on a Hamiltonian approach to general relativity and 
serves as a starting point for many hyperbolic systems, including the BSSN one. A list of references 



for hyperbolic reductions of Einstein's equations not discussed here is given in Section 4.4 



4.1 The harmonic formulation 

We start by discussing the harmonic formulation of Einstein's field equations. Like in the potential 
formulation of electromagnetism, where the Lorentz gauge V ^A^ — allows to cast Maxwell's 
equations into a system of wave equations, it was observed early in Ref. |131j that Einstein's 
equations reduce to a system of wave equations when harmonic coordinates, 

VV/=0, u = 0,1,2,3, (71) 

are used. There are many straightforward generalizations of these gauge conditions; one of them 
is to replace the right-hand side of Equation ( p7T] ) by given source functions H v |177l 1196] . 

In order keep general covariance, we follow [22^] and choose a fixed smooth background metric 
g a p with corresponding Levi-Civita connection V, Christoffel symbols r p a/ 3, and curvature tensor 
R 0fj,v Then, the generalized harmonic gauge condition can be rewritten ag 11 ] 

C := g af3 (r^ Q/3 - r" a/9 ) + fP = 0. (73) 

In the particular case where H^ = and where the background metric is Minkowski in stan- 
dard Cartesian coordinates, r p Q) g vanishes, and the condition C M = reduces to the harmonic 



coordinate expression (71 ). However, unlike the condition (71 1, Equation (73) yields a coordinate- 
independent condition for any given vector field H^ on spacetime since the difference C^ a B := 

T^ a p — T ^ a p between two connections is a tensor field. In terms of the difference, h a p := g a p — g a 8, 
between the dynamical and background metric, this tensor field can be expressed as 

C M a/3 = 2^ (Va V + Vph av - Vuhap) ■ (74) 

Of course, the coordinate-independence is now traded for the introduction of a background metric 
g a g, and the question remains of how to choose g a a and the vector field H M . A simple possibility 
is to choose H^ = and g a a equal to the initial data for the metric, such that hp, v = initially. 



11 Geometrically, this means that the identity map <j> : (M,g) — > (M,g), p \— > p, satisfies the inhomogeneous 
harmonic wave map equation 

V MV M ^+r D ^^^«r=H D , (72) 

where x^ and x are local coordinates on (M,g) and (M,g), respectively. 
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Einstein's field equations in the gauge C^ = are equivalent to the wave system 



gTVuVuhap = 2 g aT g^C a aii C T fi v + 4 C^ v{oi g ?)a C% T g VT ~ 2 g^R ^ (a g p)o 
+ 16ttG n (r al3 - -gapg^TfJ) - 2 V {a H p) , 



(75) 



where T a p is the stress-energy tensor and Gn Newton's constant. This system is subject to the 
harmonic constraint 

= C" = g^g afi (v a h 0v - ~V v hJ) + H". (76) 



4.1.1 Hyperbolicity 



For any given smooth stress-energy tensor T a p , the equations ( 75 ) constitute a quasilinear system 



of ten coupled wave equations for the ten coefficients of the difference metric h a p (or equivalently, 
for the ten components of the dynamical metric g a p) and, therefore, we can apply the results of 
Section k^ to formulate a (local in time) well-posed Cauchy problem for the wave system ( 75 ) with 
initial conditions 



(0), 



h a p(0,x) = ti^pix), 



^(0,*) = *$(*), 



(77) 



where h a l and k a l are two sufficiently smooth symmetric tensor fields defined on the initial slice 
t = satisfying the requirement that g a p(0,x) = g a p(0, x) + h a l has Lorentz signature such that 
g oo (0,x) < and the induced metric gij(0,x), i,j,= 1,2,3, on t = is positive definite Fj For 
detailed well posed Cauchy formulations we refer the reader to the original work in 164 , see also 
|165j , |159j , and Ref . [241] , which presents an improvement of the results in the previous references 
due to weaker smoothness assumptions on the initial data. 



An alternative way of establishing the hyperbolicity of the system ( 75 ) is to cast it into first 



order symmetric hyperbolic form |159[ \TE\ 1278] . There are several ways of constructing such a 
system; the simplest one is obtained |159j by introducing the first partial derivatives of g a p as new 
variables, 

dg a p n _ ,_ dg a p 



k 



a/3 



in 



D 



jaff 



dxi 



J = 1,2,3. 



Then, the second order wave system ( 75 ) can be rewritten in the form 



dg a p 
dt 

8Dj a p 

dt 

dk a p 
dt 



dkg/3 

dxi 
2 g 0j dk a p _ g ij dD ial3 



dxJ 



dxi 



(78) 
(79) 
(80) 



where l.o. are lower order terms not depending on any derivatives of the state vector u — {g a p, k a p, Dj a p). 
The system of equations ( p78J [79} [80| constitutes a quasilinear first order symmetric hyperbolic sys- 
tem for u with symmetrizer given by the quadratic form 



*H{u)u = J2 fe<M + \g 00 \k* ap k al3 + g^DtapDjap) 

a,/3=0 



(81) 



12 As indicated above, given initial data g a l and k a l it is always possible to adjust the background metric g D 



(0), 



,(0), 



such that the initial data for h a p is trivial; just replace g a p(t,x) by g a p(t,x) + ti a i{x) + tk a L{x) 
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However, it should be noted that the symmetrizer is only positive definite if g lJ is; that is, only 
if the time evolution vector field dt is time-like. In many situations, this requirement might be 
too restrictive. Inside a Schwarzschild black hole, for example, the asymptotically time-like Killing 
field d t is space-like. 



However, as indicated above, the first order symmetric hyperbolic reduction ( 78 79 80 1 is 
not unique. A different reduction is based on the variables u = (h a p,Tl a p,<bj a /3), where Tl a p := 
n^V^hap is the derivative of g a p in the direction of the future-directed unit normal n M to the 

time-slices t — const, and $j a p '■= Vjh a p- This yields a first order system which is symmetric 
hyperbolic as long as the t = const slices are space-like, independent of whether or not dt is 
time-like [TBI 15?%]. 

4.1.2 Constraint propagation and damping 

The hyperbolicity results described above guarantee that unique solutions of the nonlinear wave 



system (75) exist, at least for short times, and that they depend continuously on the initial data 



h aB , k a l. However, in order to obtain a solution of Einstein's field equations one has to ensure 



that the harmonic constraint ( 73 ) is identically satisfied 



The system ( 75 1 is equivalent to the modified Einstein equations 



Ra p + V (a C/ 9) = g^^ f Ta p _ Ig^g^T^j , (82) 

where i? Q/3 denotes the Ricci tensor, and where C M = if the harmonic constraint holds. From the 
twice contracted Bianchi identities 2V pR a @ — V Q (g AU/ i?' il/ ) = one obtains the following equation 
for the constraint variable C Q , 

ff""V„V„C a + R a p C p = -16TrG N V p T a P. (83) 

This system describes the propagation of constraint violations which are present if C M is nonzero. 
For this reason, we call it the constraint propagation system, or auxiliary system. Provided 
the stress-energy tensor is divergence free, V^T"' 9 = 0, this is a linear, second order hyperbolic 
equation for C Q pj Therefore, it follows from the uniqueness properties of such hyperbolic problems 
that C a — provided the initial data h a L k a l satisfies the initial constraints 

dC a 
C a (0,x)=0 1 — (0,x)=0. (84) 

This turns out to be equivalent to solving C"*(0, x) — plus the usual Hamiltonian and momentum 
constraints, see Refs. |164|, 1278] . Summarizing, specifying initial data h a l, k a l satisfying the 



constraints ( 84 ) , the corresponding unique solution to the nonlinear wave system ( 75 1 yields a 



solution to the Einstein equations. 



However, in numerical calculations, one cannot assume that the initial constraints (84) are 



satisfied exactly, due to truncation and roundoff errors. The propagation of these errors is described 



by the constraint propagation system ( 83 1 , and hyperbolicity guarantees that for each fixed time 
t > of existence, these errors converge to zero if the initial constraint violation converges to zero, 
which is usually the case when resolution is increased. On the other hand, due to limited computer 

13 Notice that the condition of T a " being divergence-free depends on the metric g a p itself, which is not known 
before actually solving the nonlinear wave equation (|75k, and the latter in turn depends on T a " . Therefore, 
one cannot specify T a P by hand, except in the vacuum case, T a " = 0, or in the case T a " = —Kg a P with A the 
cosmological constant. In the more general case, the stress-energy tensor has to be computed from a diffeomorphism- 
independent action for the matter fields and one has to consistently solve the coupled Einstein-matter system. 
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resources, one cannot reach the limit of infinite resolution, and from a practical point of view one 
does not want the constraint errors to grow rapidly in time for fixed resolution. Therefore, one 
would like to design an evolution scheme in which the constraint violations are damped in time, 
such that the constraint hypersurface is an attractor set in phase space. A general method for 
damping constraints violations in the context of first order symmetric hyperbolic formulations of 
Einstein's field equations was given in Ref. [75]. This method was then adapted to the harmonic 
formulation in Ref. |216j . The procedure proposed in Ref. [216J consists in adding lower order 



friction terms in Equation (82) which damp constraint violations. Explicitly, the modified system 
reads 

K*0 + V^C" 9 ) - k (n {a C^ - hi + p)g aP n ll cA = 8irG N (t^ - h aP g^T^j , (85) 

with n^ the future directed unit normal to the t = const surfaces, and n and p real constants, 
where k > determines the timescale on which the constraint violations C^ are damped. 
With this modification the constraint propagation system reads 

gTVi&vCr + R a pC 13 - kV/3 fon^C® + pg^n^cA = -16nG N V p T a/3 . (86) 

A mode analysis for linear vacuum perturbations of the Minkowski metric reveals 216 that for 
K > and p > —1 all modes except those which are constant in space are damped. Numerical 
codes based on the modified system (|85|) or similar systems have been used in the context of binary 



black hole evolutions gMl IMS ME EZ1 OH ESQ EE3] , the head-on collision of boson stars [3T2] 
and the evolution of black strings in five-dimensional gravity |271j . among other references. 

4.1.3 Geometric issues 

The results described so far guarantee the local in time unique existence of solutions to Einstein's 
equations in harmonic coordinates, given a sufficiently smooth initial data set (h^°\ k^). However, 
since general relativity is a diffeomorphism invariant theory, some questions remain. The first 
issue is whether or not the harmonic gauge is sufficiently general such that any solution of the 
field equations can be obtained by this method, at least for short enough time. The answer is 
affirmative [1641 1159] . Namely, let (M,g), M = (— e,e) x M 3 , be a smooth spacetime satisfying 
Einstein's field equations such that the initial surface t = is spacelike with respect to g. Then, 
we can find a diffeomorphism (j) : M —¥ M in a neighborhood of the initial surface which leaves it 
invariant and casts the metric into the harmonic gauge. For this, one solves the harmonic wave 



map equation ( 72 ) with initial data 



0°(O,ar) = O > ^-(0,x) = l, ^(0,x)=x\ ^(0,x)=0. 



Since equation ( 72 ) is a second order hyperbolic one, a unique solution exists, at least on some 
sufficiently small time interval (— e',e'). Furthermore, choosing e' > small enough, <f> : (—s',e') x 
M 3 — > M describes a diffeomorphism when restricted to its image. By construction, g :— (<f>~ 1 )*g 



satisfies the harmonic gauge condition ( 73 ) 



The next issue is the question about geometric uniqueness. Let g^ 1 and g^ be two solutions of 
Einstein's equations with the same initial data on t = 0, i.e., 5^2(0, x) = g a l{Q,x), d t g a l(0,x) = 

dtg a a(0,x). Are these solutions related, at least for small time, by a diffeomorphism? Again, the 
answer is affirmative 164, 159 because one can transform both solutions to harmonic coordinates 
using the above diffeomorphism <j> without changing their initial data. It then follows by the 



uniqueness property of the nonlinear wave system ( 75 1 that the transformed solutions must be 
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identical, at least on some sufficiently small time interval. Note that this geometric uniqueness 
property also implies that the solutions are, at least locally, independent of the background metric. 
For further results on geometric uniqueness involving only the first and second fundamental forms 
of the initial surface see Ref . [124] , where it is shown that every such initial data set satisfying the 
Hamiltonian and momentum constraints possesses a unique maximal Cauchy development. 

Finally, we mention that results about the nonlinear stability of Minkowski spacetime with 
respect to vacuum and vacuum-scalar perturbations have been established based on the harmonic 
system |2761 1275] , offering an alternative proof to the one of Ref. [126] . 



4.2 The ADM formulation 

In the usual 3+1 decomposition of Einstein's field equations (see, for example |208j . for a through 
discussion of it) one evolves the three metric and the extrinsic curvature (the first and second fun- 
damental forms) relative to a foliation E t of spacetime by spacelike hypersurfaces. The motivation 
for this formulation stems from the Hamiltonian description of general relativity (see, for instance, 
Appendix E in [417] ) where the "</" variables are the three metric 7^ and the associated canonical 
momenta "k 13 (the "p" variables) are related to the extrinsic curvature Kij according to 



n 



= -^(K i * 



'7 



K) 



where 7 = det(7ij) denotes the determinant of the three-metric and K — r ) 13 K^ the trace of the 
extrinsic curvature. 

In York's formulation 432 of the 3+1 decomposed Einstein equations, the evolution equations 
are 



dojij = -2K. 



'J- 



(87) 



d K v 



R 



(3) 



-DiDjd 



KK, 



2K, l K, 



lj 



8ttG 



A' 



;7y(p- 



Here, the operator do is defined as 8q :— a 1 (dt~ £p) with a and /3 1 denoting lapse and shift, 
respectively. It is equal to the Lie derivative along the future-directed unit normal n to the time 

(3) 

slices when acting on covariant tensor fields orthogonal to n. Next, i?,-- and Dj are the Ricci 
tensor and covariant derivative operator belonging to the three metric 7^, and p := n a n°T a p and 
(Tij := Tij are the energy density and the stress tensor as measured by observers moving along the 
future-directed unit normal n to the time slices. Finally, a := 7 tJ 'Ty denotes the trace of the stress 
tensor. The evolution system (87 88) is subject to the Hamiltonian and momentum constraints, 



H 
Mi 



2 h l]R 



(3) 

ij 



K 2 -JC'Kt 



SitGnp, 



■■= LPKi 



DiK = 8ttG n j u 



(89) 
(90) 



where ji := —n°Tip is the flux density. 



4.2.1 Algebraic gauge conditions 



One issue with the evolution equations ( 87 
to the three-metric, 



is the principle part of the Ricci tensor belonging 



R 



(3) 



1 



7 (-dkdua - didj^u + didk^ij + djdkju) + l-o., 



(91) 



which does not define a positive definite operator. This is due to the fact that the linearized Ricci 
tensor is invariant with respect to infinitesimal coordinate transformations 7^ 1— > 7^ + 2dr£j-\ 
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generated by a vector field £ = £, l di. This has the following implications for the evolution equa- 
tions (87|88), assuming for the moment that lapse and shift are fixed, a priori specified functions, 



(3) 



in which case the system is equivalent to the second order system 9o7y = ~~ 2i?)_- + l.o. for the 
three metric. Linearizing and localizing as described in Section [3] one obtains a linear, constant 
coefficient problem of the form (28) which can be brought into first order form via the reduc- 



tion in Fourier space described in Section [3. 1.5| The resulting first order system has the form of 
Equation ( 29 1 with the symbol 



Q(ik)=i\k\J2P J kj 







I 



-a 2 R{k) 



where R{k) is, up to a factor 2, the principal symbol of the Ricci operator, 

R\^)1ij = 7 I klk m ^fij + fejfcj7; m — Kifc;7mj — njfcl'Ymi 

Here, a, (3 l and jij refer to the frozen lapse, shift and three-metric, respectively. According 
to Theorem [2j the problem is well posed if and only there is a uniformly positive and bounded 
symmetrizer h(k) such that h(k)R{k) is symmetric and uniformly positive for k G S 2 . Although 
R(k) is diagonalizable and its eigenvalues are not negative, some of them are zero since R(k)'jij = 
for jij of the form 7^ = 2ku£j-\ with an arbitrary one-form £j, so h(k)R(k) cannot be positive. 

These arguments were used in [300] to show that the evolution system (87 88 1 with fixed lapse 
and shift is weakly but not strongly hyperbolic. 



The results in 300J also analyze modifications 
of the equations for which the lapse is densitized and the Hamiltonian constraint is used to mod- 
ify the trace of Equation (88 1. The conclusion is that such changes cannot make the evolution 



equations (87 88) strongly hyperbolic. Therefore, these equations, with given shift and densitized 



lapse, are not suited for numerical evolutions. Fj 

4.2.2 Dynamical gauge conditions leading to a well-posed formulation 

The results obtained so far often lead to the popular statement "The ADM equations are not 
strongly hyperbolic." . However, consider the possibility of determining the lapse and shift through 
evolution equations. A natural choice, motivated by the discussion in Section |4.1[ is to impose 
the harmonic gauge constraint ( 73 1 . Assuming that the background metric g a a is Minkowski in 
Cartesian coordinates for simplicity, this yields the following equations for the 3+1 decomposed 
variables, 



(dt-ftd^a = -a 2 fK + a 3 H\ 



(dt - fid,)? = -aj ij d ja + a 2 7 V' (d k j 3 i - \d jlM 



a 2 (H l +p l H t 



(92) 
(93) 



with / a constant which is equal to one for the harmonic time coordinate t. Let us analyze the 

(a,P i ,^ i j,K ij ), where 



hyperbolicity of the evolution system (92 93 87 



for generality and later use we do not necessarily assume / 



88} for the fields u 

1 in Equation (|92h 



Since this is a 



mixed first/second order system, we base our analysis on the first order pseudodifferential reduction 
discussed in Section[3.1.5| After linearizing and localizing, we obtain the constant coefficient linear 



14 Weak hyperbolicity of the system | |87||88[ ) with given shift and densitized lapse has also been pointed out in |248l 
based on a reduction which is first order in time and space. However, there are several inequivalent such reductions, 
and so it is not sufficient to show that a particular one is weakly hyperbolic in order to infer that the second order 
in space system 187| 188} is weakly hyperbolic. 
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problem 



{d t -p k d k )a 



-a 2 fK, 



(fit - p k d k )p = - al ij d ja + a W' ( d kl3l - -d, jlkl ) , 



{dt-^dk)^ 

(d t -p k d k )K tJ 



2 lk(l d 3) p k ~2aK, 



1*3 i 



-fi^a + - 7 " (-fi^ 7 y - Mm + 9id^ + dfr-n) 



(94) 
(95) 

(96) 

(97) 



where a, (3 k and 7y refer to the quantities corresponding to a, f3 k , 7y of the background metric 
when frozen at a given point. In order to rewrite this in first order form, we perform a Fourier 
transformation in space and introduce the variables U — (a, bj,£y,py) with 



\k\a/a, 



h := {kl'Yijfr/a, kj := \k\%, py := 2?i£ 



y » 



where |fc| := y^^'fcjfcj and the hatted quantities refer to their Fourier transform. With this, we 

obtain the first order system U t — P(ik)U where the symbol has the form P(ik) — i/3 s k s I + aQ(ik) 
with 

/ L 



Q(ik) 



( a \ 



kj 

V Pv J 



i\k\ 



J? 



±kl 



\ 



Zk^bjj+pij 



where k{ := ki/\k\, k := j^kj, I := 7 lJ 2y, and p := 7 y py. In order to determine the eigenfields 
S(k)~ 1 U such that S(k)~ 1 P(ik)S(k) is diagonal, we decompose 

b t = bki + hi, 

lij = ikikj + 2k(ilj) + (y + —(7 y ~ kikj)l , py = pkikj + 2k(iPj) -\-pij + —(7 y — kikjjp 



into pieces parallel and orthogonal to ki, similar as in Example 15 Then, the problem decouples 
into a tensor sector, involving (Zy,py), into a vector sector, involving (pi, li,pi) and a scalar sector 
involving (a, b, l,p, l',p')- In the tensor sector, we have 



Q(ten.or)( ife ) J iij j = ,|;. 



Py 



which has the eigenvalues ±i|fe| with corresponding eigenfields iy ±Py- In the vector sector, we 
have 



Q 



(•vector) 



(ifc) 



h 

Pi 



i\k\ 



bj + Pj 




(98) 



which is also diagonalizable with eigenvalues 0, ±z|fc| and corresponding eigenfields pj and Ij ± 
(bj +pj)- Finally, in the scalar sector we have 



Q 



(scalar) 



m 



1 a \ 


1 


b 




I 
P 


= i\k\ 


V 




\? J 


\ 



/ Up+p') \ 

-a +1(1 -I') 
2b + p 
2a + V 



P 
V 



J 
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It turns out Q( scalar \ik) is diagonalizable with purely imaginary values if and only if / > and 
/ y^ 1. In this case, the eigenvalues and corresponding eigenfields are ±i|fc|, ±i|A;|, ±i^/f\k\ and 
V ± p 1 , T± (26 + p), a + fl'/(f - 1) ± v7[P + (/ + !)/(/ - 1)p']/2, respectively. A symmetrizer 
for P(ik) which is smooth in k € S 2 , a, (3 k and 7y can be constructed from the eigenfields as in 
Example [15| 
Remarks: 

• If instead of imposing the dynamical shift conditi on ([93]) , (3 is a priori specified, then the 
resulting evolution system, consisting of Equations (92 87 88 1, is weakly hyperbolic for any 



choice of /. Indeed, in that case the symbol (98 1 in the vector sector reduces to the Jordan 
block 



Q(vector)^ 

which cannot be diagonalized. 






= i\k\ 



h 
Pj 



When linearized about Minkowski spacetime, it is possible to classify the characteristic fields 



into physical, constraint-violating and gauge fields, see Ref. 103 . For the system (94 97) the 
physical fields are the ones in the tensor sector, ty ± pij , the constraint-violating ones are pj 
and I' ±p' , and the gauge fields are the remaining characteristic variables. Observe that the 



constraint-violating fields are governed by a strongly hyperbolic system (see also Section 4.2.4 
below), and that in this particular formulation of the ADM equations the gauge fields are 
coupled to the constraint-violating ones. This coupling is one of the properties which make 
it possible to cast the system as a strongly hyperbolic one. 



We conclude that the evolution system (92 93 87 



is strongly hyperbolic if and only 



if / > and / ^ 1. Although the full harmonic gauge condition (73) is excluded from these 
restrictions J^J there is still a large family of evolution equations for the lapse and shift that give 
rise to a strongly hyperbolic problem together with the standard evolution equations (87 88 1 from 
the 3+1 decomposition. 



4.2.3 Elliptic gauge conditions leading to a well-posed formulation 

Rather than fixing the lapse and shift algebraically or dynamically, an alternative which has been 
considered in the literature is to fix them according to elliptic equations. A natural restriction 
on the extrinsic geometry of the time slices £4 is to require that their mean curvature, c = 
—K/3, vanishes or is constant [379] . Taking the trace of Equation (88) and using the Hamiltonian 



? (3) 

1 '.; 



constraint to eliminate the trace of R)*' yields the following equation for the lapse, 



[-D j Dj + K ij Kij + 4ttG n ( P + a)] a = d t K, 



(99) 



which is a second order linear elliptic equation. The operator inside the square parenthesis is 
formally positive if the strong energy condition, p+a > 0, holds, and so it is invertible when defined 
on appropriate function spaces. See also Ref. [197J for generalizations of this condition. Concerning 
the shift, one choice which is motivated by eliminating the "bad" terms in the expression for the 
Ricci tensor, Equation (91), is the spatial harmonic gauge [23 . In terms of a fixed (possibly 
time-dependent) background metric jij on S t , this gauge is defined as (cf. Equation (73)) 



= V k 



r 



(r fc «-r h «) 



j l 3j kl 



D kllj - 7, D jlkl 



(100) 



15 Notice that even when / = 1 the evolution system J92[ |93| |87[ |88| ) is not equivalent to the harmonic system 
discussed in the previous subsection. In the former case, the harmonic constraint is exactly enforced in order to 
obtain evolution equations for the lapse and shift; while in the latter case first derivatives of the harmonic constraint 
are combined with the Hamiltonian and momentum constraints, see Equation 182k. 
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where D is the Levi-Civita connection with respect to 7 and T k ij denote the corresponding 
Christoffel symbols. The main importance of this gauge is that it permits to rewrite the Ricci 
tensor belonging to the three metric in the form 



4? = -\"t kl DkDa l3 + D^Vj) + l.o. 



where D k denotes the covariant derivative with respect to the background metric 7 and where the 
lower order terms "l.o." depend only on 7^ and its first derivatives Dkjij- When V = The 
operator on the right-hand side is second order quasilincar elliptic, and with this, the evolution 



system (87 88 ) has the form of a nonlinear wave equation for the three-metric 7^ . However, the 
coefficients and source terms in this equation still depend on the lapse and shift. For constant mean 



curvature slices the lapse satisfies the elliptic scalar equation ( 99 1 , and with the spatial harmonic 



gauge the shift is determined by the requirement that Equation ( 100 ) is preserved throughout 

evolution, which yields an elliptic vector equation for it. In Ref 

hyperbolic-elliptic system consisting of the evolution equations (|87 

rewritten in elliptic form using the condition V k = 0, the constant mean curvature condition (99 1 



23 it w as shown that the coupled 

(3) 



88 1 with the Ricci tensor R 



and this elliptic equation for /3 l , gives rise to a well posed Cauchy problem in vacuum. Besides 
eliminating the "bad" terms in the Ricci tensor, the spatial harmonic gauge has also other nice 
properties which were exploited in the well-posed formulation of Ref. [23] . For example, the 
covariant Laplacian of a function / is 

D k D k f=^D i D j f-V k D k f, 

which does not contain any derivatives of the three metric j^ if V k = 0. For applications of the 
hyperbolic-elliptic formulation in |23j to the global existence of expanding vacuum cosmologies, 
see Refs. [SUES]. 

Other methods for specifying the shift have been proposed in [379] , with the idea of minimizing 
a functional of the type 

i\p]= /e«%V7d 3 x, (101) 



where Qij :— dtjij/2 = —aKij + Duft(j\ is the strain tensor. Therefore, the functional J[/3] 
minimizes time changes in the three metric in an averaged sense. In particular, I[f3] attains its 
absolute minimum (zero) if dt is a Killing vector field. Therefore, one expects the resulting gauge 
condition to minimize the time dependence of the coordinate components of the three metric. An 



alternative is to replace the strain by its trace-free part on the right-hand side of Equation ( 101 1 



giving rise to the minimal distortion gauge. Both conditions yield a second order elliptic equation 
for the shift vector which has unique solutions provided suitable boundary conditions are specified. 
For generalizations and further results on these type of gauge conditions, see Refs. [711 11971 1198] . 
However, it seems to be currently unknown whether or not these elliptic shift conditions, together 



with the evolution system (87 88) and an appropriate condition on the lapse, lead to a well posed 
Cauchy problem. 



4.2.4 Constraint propagation 



The evolution equations (87 88 ) are equivalent to the components of the Einstein equations cor- 



responding the spatial part of the Ricci tensor, 



Rij — 8itGn 



T,, 



1 



.lij9 1-HV 



(102) 
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and in order to obtain a solution of the full Einstein equations one also needs to solve the constraints 
H = 8ttGnP and Mi = 8ttGnJz- As in the previous subsection, the constraint propagation system 
can be obtained from the twice contracted Bianchi identities which, in the 3+1 decomposition, 
read 

d H + \di (a 2 M,-) - 2KH - (K ij - K^) R tj = 0, (103) 

d Mi + \d, (a 2 H) - KM, + ~D j (etR« - 07«7*'-Rfci) = 0- (104) 

or a 

The condition of the stress-energy tensor being divergence-free leads to similar evolution equations 



for p and ji. Therefore, the equations (102 1 lead to the following symmetric hyperbolic system 



|184U431] for the constraint variables H := H — 8ttGnp and M.i := Mi — 8irGNJi, 

d Q H = -\d j (a 2 M 1 )+2KH, (105) 

doM> = -^D. l (a 2 U)+KM l . (106) 

As has also been observed in [184] , the constraint propagation system associated to the standard 



ADM equations, where Equation ( 102 1 is replaced by its trace-reversed version R i j — j i jg lJ - l/ R^/2 = 
8irGpfTij is 

d n = -^Di (a 2 Mj)+KH, 
or 

d Q M t = -—H + KMi, 
a 

which is only weakly hyperbolic. Therefore, it is much more difficult to control the constraint fields 
in the standard ADM case than in York's formulation of the 3+1 equations. 

4.3 The BSSN formulation 

The BSSN formulation is based on the 3+1 decomposition of Einstein's field equations. Unlike the 
harmonic formulation, which has been motivated by the mathematical structure of the equations 
and the understanding of the Cauchy formulation in general relativity, this system has been mainly 
developed and improved based on its capability of numerically evolving spacetimes containing 
compact objects in a stable way. Interestingly, it turns out that in spite of the fact that the 
BSSN formulation is based on an entirely different motivation, mathematical questions like the 
well posedness of its Cauchy problem can be answered, at least for most gauge conditions. 

In the BSSN formulation, the three metric 7^ and the extrinsic curvature K t j are decomposed 
according to 

lii = e*%j, (107) 

K tJ = e^ULij + pyK). (108) 

Here, K = j l: >Kij and Ay are the trace and the trace-less part, respectively, of the conformally 
rescaled extrinsic curvature. The conformal factor e 2 ^ is determined by the requirement for the 
conformal metric jij to have unit determinant. Aside from these variables one also evolves the 
lapse (a), the shift (/3 l ) and its time derivative (B l ), and the variable 



:= -9,7 
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k'J 



In terms of the operator <9 = d t — ffldj the BSSN evolution equations are 
d Q a = -a 2 f(a,^x^(K~K Q (x^), 



dnK 



-4tp 



D l D t a + 28,(1) ■ D l a 



A lJ A, 



3 



aS, 



d /3 l = a 2 G(a,0,^)B l , 

d Q B l = e- A<t >H(a, fa^dot* -*?{&, a, x") 



do 



= -!* 



^ fe /3 fe , 



9o7« = -2ai4 <i + 2% {i d j) l3 k - pudkP", 



d Aij = 



-A<j> 



aRij + aRf, — DiDja + 4d^<j> ■ Dj)cx 



TF 



2 ~. 



+ aKA l} - 2aA lk A) + 2A k{i d j) K - ^A l3 d k p k - ae" 4 *^, 



2A %J d i a + 2a 



(to - l)5 fe i fel - ^P*lf + m(f\;I w + ei^fy 



(109) 
(110) 

(111) 
(112) 

(113) 
(114) 

(115) 
5*, (116) 



Here, quantities with a tilde refer to the conformal three metric 7y, which is also used in order 
to raise and lower indices. In particular, Di and T k ij denote the covariant derivative and the 
Christoffel symbols, respectively, with respect to 7y. Expressions with a superscript TF refer to 
their trace-less part with respect to the conformal metric. Next, the sum R\j + R\- represents the 
Ricci tensor associated to the physical three metric 7^ , where 

1 



R; 



'f^d k di% + 7 fc (A-)f fe - t [l])k da lk + 7" (2f \it i)ka + f fe is f fey ) , (117) 



Rl = -2AAc 



2j ZJ D k D k + ADrfDrf - A^ fij D k (j>D M 



(118) 



The term 8qT 1 in Equation (112 1 is set equal to the right-hand side of Equation (116 1. The 



parameter m in the latter equation modifies the evolution flow off the constraint surface by adding 
the momentum constraint to the evolution equation for the variable V 1 . This parameter was first 
introduced in Ref. [8] in order to compare the stability properties of the BSSN evolution equations 
with those of the ADM formulation. 



The gauge conditions which are imposed on the lapse and shift in Equations ( 109 



111 



112) 



were introduced in |49) and generalize the Bona-Masso condition |60) and the hyperbolic Gamma 
driver condition [5]. It is assumed that the functions /(a, </>, x' 1 ), G(a,(/),x fi ) and H(a, 4>, x^) are 
strictly positive and smooth in their arguments, and that Kq(x^) and ^(B 3 , a, 2;^) are smooth 
functions of their arguments. The choice 



m = 1, 

G{a,<j),xi") 



/(a, <£, ar* 4 ) 

3 

~ 4«2 ' 



2 
a 



H(a,()),x^) = e 



K Q (xn = 0, 



P-\ — e H 



T] i (B j ,a,x^) = T)B i , 



(119) 
(120) 



with 77 a positive constant, corresponds to the evolution system used in many black hole simulations 
based on 1 + log slicing and the moving puncture technique (see, for instance, Ref. |411j and 
references therein). Finally, the source terms 5, Sly and S l are defined in the following way: 

denoting by R\j and R\j the Ricci tensors belonging to the three- metric 7y and the spacetime 
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metric, respectively, and introducing the constraint variables 



the source terms are defined as 



S := f j R, 



(4) 



2H, 



n := -A i 13 R$ 



-K 2 -A^A ljr 



C l := r + drf', 



(121) 

(122) 
(123) 



Sij :— 



R^+% {l d 3) C k 



TF 



S l :=2amf : >M j -d G C 



For vacuum evolutions one sets S = 0, Sij = and S* = 0. When matter fields are present, the 



Einstein field equations are equivalent to the evolution equations ( 109 116 1 setting S — —4:TtGn(p+ 



a), Sij = &"kGn<tJ; F , S l = 16nGNTna ; y' lk jk and the constraints H = 8ttGnP, Mi = St^GmJi and 



When comparing Cauchy evolutions in different spatial coordinates, it is very convenient to 
reformulate the BSSN system such that it is covariant with respect to spatial coordinate transfor- 
mations. This is indeed possible, see Refs. [77J EI]- One way of achieving this is to fix a smooth 

background three-metric 7^, similarly as in Section 4.1 and to replace the fields <p and T l by the 
scalar and vector fields 



1 
12 



log 



r 



■Dtfi, 



where 7 and 7 denote the determinants of 7^ and 7 y , and Dj is the covariant derivative associated 



to the latter. If 7^ is flafl ib and time-independent, the corresponding BSSN equations are obtained 



by replacing dk ^ Dk and T k ij \-t T k ij — T k ij in Equations (109 116 



117 



118 



121 123) 



4.3.1 The hyperbolicity of the BSSN evolution equations 



In fact, the ADM formulation in the spatial harmonic gauge described in Section |4.2.3| and the 
BSSN formulation are based on some common ideas. In the covariant refor mulation of BSSN just 
mentioned, the variable T 4 is just the quantity V 1 defined in Equation (100), where 7^ is replaced 
by the conformal metric 7 y . Instead of requiring T l to vanish, which would convert the operator 



on the right-hand side of Equation (117) into a quasilinear elliptic operator, one promotes this 



quantity to an independent field satisfying the evolution equation ( |116| ) (see also the discussion 
below Equation (2.18) in Ref. 378 ). In this way, the 7^ — A^-block of the evolution equations 
forms a wave system. However, this system is coupled through its principal terms to the evolution 
equations of the remaining variables, and so one needs to analyze the complete system. As follows 



from the discussion below, it is crucial to add the momentum constraint to Equation (116) with 
an appropriate factor m in order to obtain a hyperbolic system. 

The hyperbolicity of the BSSN evolution equations was first analyzed in a systematic way in 
Ref. [361], where it was established that for fixed shift and densitized lapse, 



Q 



12errf 



(124) 



the evolution system (110 113 116) is strongly hyperbolic for a > and m > 1/4 and symmet- 
ric hyperbolic for m > 1 and 6a — Am — 1. This was shown by introducing new variables and 

16 If -fij is time-independent but not flat, additional curvature terms appear in the equations, see Appendix A in 
Ref. ESI. 
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enlarging the system to a strongly or symmetric hyperbolic first order one. In fact, similar first 
order reductions were obtained in [190, 1182"] . However, in Ref. [361] it was shown that the first 
order enlargements are equivalent to the original system if the extra constraints associated to the 
definition of the new variables are satisfied, and that these extra constraints propagate indepen- 
dently of the BSSN constraints H — 0, Mi = and C l = 0. This establishes the well posedness of 



the Cauchy problem for the system ( 124 110 113 116 1 under the aforementioned conditions on a 
and m. Based on the same method, a symmetric hyperbolic first order enlargement of the evolu- 
tion equations (109 110 113 1161 and fixed shift was obtained in Ref. [15] under the conditions 



/ > and 4m = 3/ + 1 and used to construct boundary conditions for BSSN. First order strongly 



hyperbolic reductions for the full system (109-116) have also been recently analyzed in Ref. [75] . 
An alternative and efficient method for analyzing the system consists in reducing it to a first 
order pseudodifferential system, as described in Section |3.1.5[ This method has been applied 
in |300j to derive a strongly hyperbolic system very similar to BSSN with fixed, densitized lapse 
and fixed shift. This system is then shown to yield a well posed Cauchy problem. In [15J the same 
method was applied to the evolution system (109 -116). Linearizing and localizing, one obtains a 
first order system of the form U t — P(ik)U — if3 s k s U + aQ(ik)U. The eigenvalues of Q(ik) are 0, 
±i, ±iy/m, H-y/Ji, ±\/7; ±VGH, ±\/k, where we have defined \i := (4m — l)/3 and k :— 4GH/3. 



k > 0, 



The system is weakly hyperbolic provided that 

/ > 0, ijl > 0, 



and it is strongly hyperbolic if, in addition, the parameter m and the functions /, G, and H can 
be chosen such that the functions 



/ 



m — 1 

\X — K 



6(ra — 1)k 
4m — 3k 



are bounded and smooth. In particular, this requires that the nominators converge to zero at least 
as fast as the denominators when / — > k, /i — > K or 3k — > 4m, respectively. Since k > 0, the 
boundedness of n/(f — k) requires that / ^ k. For the standard choice m = 1, the conditions 
on the gauge parameters leading to strong hyperbolicity are, therefore, / > 0, n > and f ^= k. 



Unfortunately, for the choice (119 120) used in binary black hole simulations these conditions 
reduce to 

e 4 * ^ 2a. (125) 

which is typically violated at some two-surface, since asymptotically, a —¥ 1 and <j> — > while near 
black holes a is small and (f> positive. It is currently not known whether or not the Cauchy problem 
is well posed if the system is strongly hyperbolic everywhere except at points belonging to a set 
of zero measure, such as a two-surface. Although numerical simulations based on finite difference 



discretizations with the standard choice (119 120) show no apparent sign of instabilities near 



such surfaces, the well posedness for the Cauchy problem for the BSSN system ( 109 116 1 with the 



choice ( 119 120 ) for the gauge source functions remains an open problem when the condition ( 125 ) 



is violated. However, a well posed problem could be formulated by modifying the choice for the 
functions G and H such that / ^ k and /, k > are guaranteed to hold everywhere. 

Yet a different approach to analyzing the hyperbolicity of BSSN has been given in [219, 218 
based on a new definition of strongly and symmetric hyperbolicity for evolution systems which are 
first order in time and second order in space. Based on this definition, it has been verified that the 
BSSN system (124 110 113f 116[ ) is strongly hyperbolic for a > and m > 1/4 and symmetric 
hyperbolic for 6cr = 4m — 1 > 0. (Note that this generalizes the original result in |361j where, in 
addition, m > 1 was required.) The results in [218] also discuss more general 3+1 formulations 
including the one in [300] and construct constraint-preserving boundary conditions. The relation 
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between the different approaches to analyzing hyperbolicity of evolution systems which are first 
order in time and second order in space has been analyzed in [213] . 



Strong hyperbolicity for different versions of the gauge evolution equations ( 109 



111 



1121 



where the normal operator <9 is sometimes replaced by d t , has been analyzed in |214| . See Table I 
in that reference for a comparison between the different versions and the conditions they are subject 
to in order to satisfy strong hyperbolicity. It should be noted that when m — 1 and do is replaced 
by <9 t , additional conditions restricting the magnitude of the shift appear in addition to / > and 



4.3.2 Constraint propagation 



As mentioned above, the BSSN evolution equations (109 116) are only equivalent to Einstein's 
field equation if the constraints 



U:=H- 8irG N p = 0, 



Mi := Mi - 8%G N ji 



0. 



C 8 = 



are satisfied. Using the twice contracted Bianchi identities in their 3+1 decomposed form, Equa- 



tions (103 104), and assuming that the stress-energy tensor is divergence free, it is not difficult to 



show that the equations (109 116) imply the following evolution system for the constraint fields 
P1I2I5]: 



d n = 

d Mj = 
d C l = 



1 



— DUpfM fi- 
at 



2a 



a 
~3 

2amY 3 Mj 



ae-^A^% t d,C k + y KH, 
H) + aKMj + Midjft + D' 



(a[% {t d 3) C k ] TF ), 



(126) 

(127) 
(128) 



This is the constraint propagation system for BSSN, which describes the propagation of constraint 
violations which are usually present in numerical simulations due to truncation and roundoff errors. 
There are at least three reasons for establishing the well posedness of its Cauchy problem. The 



first reason is to show that the unique solution of the system (126 128) with zero initial data is 



the trivial solution. This implies that it is sufficient to solve the constraints at the initial time 
t — 0. Then, any smooth enough solution of the BSSN evolution equations with such data satisfies 
the constraint propagation system with T-L = 0, Mj = and C l = 0, and it follows from the 
uniqueness property of this system that the constraints must hold everywhere and at each time. In 
this way, one obtains a solution to Einstein's field equations. However, in numerical calculations, 
the initial constraints are not exactly satisfied due to numerical errors. This brings us to the second 
reason for having a well posed problem at the level of the constraint propagation system; namely, 
the continuous dependence on the initial data. Indeed, the initial constraint violations give rise 
to constraint violating solutions; but if these violations are governed by a well posed evolution 
system the norm of the constraint violations is controlled by those of the initial violations for 
each fixed time t > 0. In particular, the constraint violations must converge to zero if the initial 
constraint violations do. Since the initial constraint errors go to zero when resolution is increased 
(provided a stable numerical scheme is used to solve the constraints), this guarantees convergence 
to a constraint-satisfying solution^ Finally, the third reason for establishing well posedness for 
the constraint propagation system is the construction of constraint-preserving boundary conditions 
which will be explained in detail in Section [6] 



The hyperbolicity of the constraint propagation system ( 126 - 128 ) has been analyzed in Refs. [2181 
, [7U HO] > and |307] and shown to be reducible to a symmetric hyperbolic first order system for 



17 However, one should mention that this convergence is usually not sufficient for obtaining accurate solutions. If 
the constraint manifold is unstable, small departure from it may grow exponentially in time and even though the 
constraint errors converge to zero they remain large for finite resolutions, see Ref. |248l for an example of this effect. 
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m > 1/4. Furthermore, there are no superluminal characteristic fields if 1/4 < m < 1. Because of 
finite speed of propagation, this means that BSSN with 1/4 < m < 1 (which includes the standard 
choice m = 1) does not possess superluminal constraint-violating modes. This is an important 
property, for it shows that constraint violations that originate inside black hole regions (which 
usually dominate the constraint errors due to high gradients at the punctures or stuffing of the 
black hole singularities in the turducken approach |152ll74"ll80| ) cannot propagate to the exterior 
region. 

In [343] a general result is derived, showing that under a mild assumption on the form of the 
constraints, strong hyperbolicity of the main evolution system implies strong hyperbolicity of the 
constraint propagation system, with the characteristic speeds of the latter being a subset of those 
of the former. The result does not hold in general if "strong" is replaced by "symmetric", since 
there are known examples for which the main evolution system is symmetric hyperbolic while the 
constraint propagation system is only strongly hyperbolic |105j . 

4.4 Other hyperbolic formulations 

There exist many other hyperbolic reductions of Einstein's field equations. In particular, there has 
been a large amount of work on casting the evolution equations into first order symmetric [4] 11771 

[T89i on nm Esi una na na Esa ghh ezq usa ma Era Era ng and strongly hyperbolic na 

EEQ E3 ED 133 QH E3 I335J EH E3 133 [75] form, see [177] ES3 OH [333] for reviews. For systems 
involving wave equations for the extrinsic curvature, see |125l 0] , see also Ref . |412j and Refs. [13 
73, 362, 367, 424 for applications to perturbation theory and the linear stability of solitons and 
hairy black holes. 

Recently, there has also been work deriving strongly or symmetric hyperbolic formulations from 
an action principle [78, 56, 238 . 
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5 Boundary Conditions: The Initial-Boundary Value Prob- 
lem 

In Section[3]we discussed the general Cauchy problem for quasilinear hyperbolic evolution equations 
on the unbounded domain R". However, in the numerical modeling of such problems one is faced 
with the finiteness of computer resources. A common approach for dealing with this problem is to 
truncate the domain via an artificial boundary, thus forming a finite computational domain with 
outer boundary. Absorbing boundary conditions must then be specified at the boundary such that 
the resulting initial-boundary value problem (IBVP) is well posed and such that the amount of 
spurious reflection is minimized. 

Therefore, wc examine in this section quasilinear hyperbolic evolution equations on a finite, 
open domain £ C R™ with C°°-smooth boundary <9£. Let T > 0. We are considering an IBVP of 
the following form, 



ut=T,Al(t,x,u)^u + F(Tt,x,u), xeS, t£[0,T], 



j=i 



u(0,x) = f(x), 
b(t,x,u)u = g(t,x), 



x £ S, 

.xeas, t£[0,T\, 



(129) 

(130) 
(131) 



where u(t,x) £ C m is the state vector, A 1 (t, x, u), . . . ,A n (t,x,u) are complex mxm matrices, 
F(t,x,u) £ C m , and b(t,x,u) is a complex r x m matrix. As before, we assume for simplicity 
that all coefficients belong to the class C%°([0, T] x £ x C m ) of bounded, smooth functions with 
bounded derivatives. The data consists of the initial data / £ C£°(£,C m ) and the boundary data 
S eC£ o ([O,T]x0E > C p ). 

Compared to the initial-value problem discussed in Section [3] the following new issues and 
difficulties appear when boundaries are present: 

• For a smooth solution to exist, the data / and g must satisfy appropriate compatibility 
conditions at the intersection S := {0} x <9£ between the initial and boundary surface 1333,- 
Assuming that u is continuous, for instance, Equations (130 131) imply that g(0,x) — 
6(0, x, f(x))f(x) for all x £ 9S. If u is continuously diffcrcntiable, then taking a time 



derivative of Equation ( 131 ) and using Equations ( 129 130 1 leads to 



gt(0,x) = c(x) 



Of 



J2A j (0,x,f(x)) 7 ^ J (x)+F(0,xJ(x)) 



b t (0,x,f(x))f(x), 



x£dT, 



where c(x) is the complex r x m matrix with coefficients 



c(xY 



b(0,xJ(x)Y 



E 

G=\ 



db' 



du B 



(0,xj(x))f(x) c 



A = l,. 



B = l, 



. . , m. 



Assuming higher regularity of u, one obtains additional compatibility conditions by taking 
further time derivatives of Equation (131). In particular, for an infinitely differentiable 



solution u, one has an infinite family of such compatibility conditions at S, and one must 
make sure that the data /, g satisfies each of them if the solution u is to be reproduced 
by the IBVP. If an exact solution u^ ' of the partial differential equation (129) is known, a 



convenient way of satisfying these conditions is to choose the data such that in a neighborhood 
of S, f and g agree with the corresponding values for u^°\ i.e such that f(x) — u^°\0,x) 
and g(t,x) — b(t, x, u^ (t, x))u^ (t, x) for (t,x) in a neighborhood of S. Depending on the 
problem at hand, this might be too restrictive, however. 
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• The next issue is the question of what class of boundary conditions ( 131 1 leads to a well- 
posed problem. In particular, one would like to know which are the restrictions on the matrix 
b(t, x, u) implying existence of a unique solution, provided the compatibility conditions hold. 
In order to illustrate this issue on a very simple example, consider the advection equation 
Ut = u x on the interval [— 1, 1]. The most general solution has the form u(t, x) = h(t + x) for 
some differentiablc function h : (— l,oo) — > C The function h is determined on the interval 
[— 1, 1] by the initial data alone, and so the initial data alone fixes the solution on the strip 
— 1 — t < x < 1 — t. Therefore, one is not allowed to specify any boundary conditions at 
x = — 1, whereas data must be specified for u at x = 1 in order to uniquely determine the 
function h on the interval (1, oo). 

• Additional difficulties appear when the system has constraints, like in the case of electro- 
magnetism and general relativity. In the previous section we saw in the case of Einstein's 
equations that it is usually sufficient to solve these constraints on an initial Cauchy surface, 
since the Bianchi identities and the evolution equations imply that the constraints propa- 
gate. However, in the presence of boundaries one can only guarantee that the constraints 
remain satisfied inside the future domain of dependence of the initial surface So := {0} x X 
unless the boundary conditions are chosen with care. Methods for constructing constraint- 
preserving boundary conditions which make sure that the constraints propagate correctly 
on the whole spacetime domain [0, T] x X will be discussed in the next section. 



There are two common techniques for analyzing an IBVP. The first, discussed in Section [5TTJ is 
based on the linearization and localization principles, and reduces the problem to linear, constant 
coefficient IBVPs which can be explicitly solved using Fourier transformations, similarly to the case 
without boundaries. This approach, called the Laplace method, is very useful for finding necessary 
conditions for the well posedness of linear, constant coefficient IBVPs. Likely, these conditions 
are also necessary for the quasilinear IBVP since small amplitude high frequency perturbations 
are essentially governed by the corresponding linearized, frozen coefficient problem. Based on the 
Kreiss symmetrizer construction [252] and the theory of pseudo-differential operators, the Laplace 
method also gives sufficient conditions for the linear, variable coefficient problem to be well posed; 
however, the general theory is rather technical. For a discussion and interpretation of this approach 
in terms of wave propagation we refer to [236] . 

The second method, which is discussed in Section [5~2| is based on energy inequalities obtained 
from integration by parts and does not require the use of pseudo-differential operators. It provides 
a class of boundary conditions, called maximal dissipative, which leads to a well posed IBVP. 
Essentially, these boundary conditions specify data to the incoming normal characteristic fields, or 
to an appropriate linear combination of the in- and outgoing normal characteristic fields. Although 



technically less involved than the Laplace one, this method requires the evolution equations ( 129 ) 
to be symmetric hyperbolic in order to be applicable, and it gives sufficient but not necessary 
conditions for well posedness. 



In Section 5.3 we also discuss absorbing boundary conditions, which are designed to minimize 



spurious reflections from the boundary surface. 



5.1 The Laplace method 



Upon linearization and localization, the IBVP (129 130 131) reduces to a linear, constant coeffi- 
cient problem of the following form, 

n 

«t= E^gii u + F o(M), i£S, * > 0, (132) 

u(0,x) = f(x), x e X, (133) 
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bu = g(t,x), 



seffi, t > 0, 



(134) 



where A> = Ai(t$, Xq, u^°\to,Xo)), b = b(to,XQ,u^(ta,XQ)) denote the matrix coefficients cor- 
responding to A 3 {t, x, u) and b(t,x,u) linearized about a solution ir ' and frozen at the point 
Po = (^0i x o)> an d where for generality we include the forcing term F Q (t, x) with components in the 
class C£°([0, 00) x E). Since the freezing process involves a zoom into a very small neighborhood of 
Po, we may replace E by K" for all points po lying inside the domain E. We arc then back into the 



case of the previous section, and we conclude that a necessary condition for the IBVP (129 130 



131 ) to be well posed at u^°\ is that all linearized, frozen coefficient Cauchy problems corresponding 



to po S E are well posed. In particular, the equation ( 132 ) must be strongly hyperbolic 



Now let us consider a point po £ <9E at the boundary. Since 9E is assumed to be smooth, it 
will be mapped to a plane during the freezing process. Therefore, taking points po £ 3E, it is 



sufficient to consider the linear, constant coefficient IBVP (132 133 134) on the half space 



E := {(xi,x 2 ,.--,x n ) £ 



j'l 



>0}, 



say. This is the subject of this subsection. Because we are dealing with a constant coefficient 
problem on the half-space, we can reduce the problem to an ordinary differential boundary problem 
on the interval [0, 00) by employing Fourier transformation in the directions t and y := (x 2 , ■ ■ . , x n ) 
tangential to the boundary. More precisely, we first exponentially damp the function u(t, x) in time 
by defining for 77 > the function 



u v (t,x) 



-^u(t,x) iort>0, xeE, 
for t < 0, x £ E. 



(135) 



We denote by Ur,{^,xi,k) the Fourier transformation of u v (t,xi,y) with respect to the directions 
t, and y tangential to the boundary and define the Laplace-Fourier transformation of u by 



u(s, xi,k) := u n {£,x\,k) 



(2tt)"/ 2 
Then, u satisfies the following boundary value problem, 



•*- <fc -Mt,zi,l/)cftd ,l - 1 i/ J 



V + i£, 



Aj^u = B(s,k)u + F(s,xi,k), x x > 0, 



dx\ 



bu = g{s, k) 



xi = 0, 



(136) 
(137) 



where for notational simplicity we set A :— A 1 and B^ := A 7 , j = 2, . . . , n, and where B(s, k) := 
si — iB 2 k 2 — ... — iB n k n . Here, F(s, x\,k) = Fq(s, x\, k) + f(x\, k) with Fg and / denoting the 
Laplace-Fourier and Fourier transform, respectively, of Fq and /, and g(s, k) is the Laplace-Fourier 
transform of the boundary data g. 

In the following, we assume for simplicity that the boundary matrix A is invertible, and that 



the equation ( 132 ) is strongly hyperbolic. An interesting example with a singular boundary matrix 



is mentioned in Example 26 below. If A can be inverted, then we rewrite Equation ( 136 ) as the 



linear ordinary differential equation 





dxi 



u = M(s,k)u + A 1 F{s,xi,k), x\ > 0, 



(138) 



where M{s,k) := A^ 1 B(s,k). We solve this equation subject to the boundary conditions (137) 
and the requirement that u vanishes as x\ — > 00. For this it is useful to have information about 
the eigenvalues of M(s, k). 
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Lemma 3 ( 252, 253, 222 ). Suppose the equation (132) is strongly hyperbolic and the boundary 
matrix A has q negative and m — q positive eigenvalues. Then, M(s,k) has precisely q eigenvalues 
with negative real part and m — q eigenvalues with positive real part. (The eigenvalues are counted 
according to their algebraic multiplicity.) Furthermore, there is a constant S > such that the 
eigenvalues re of M(s,k) satisfy the estimate 

\Re(K)\>5Re(s), (139) 

for all Re(s) > and k £ R"- 1 . 

Proof. Let Re(s) > 0, (3 £ R and k £ R™" 1 . Then 

M(s, k) - i/3I = A- 1 [si - i(3A - ik. 3 B' } \ = A- 1 [si - P (i/3, ik)} . 



Since the equation (|132) is strongly hyperbolic there is a constant K and matrices S(/3,k) such 



that (see the comments below Definition [2]) 

\S(/3, k)\ + \S(/3, fc)- 1 ! < K, S(/3, fe)- 1 P (^, ik)S(fi, k) = iK((3, k), 

for all (j3, k) £ R™, where A(/3, k) is a real, diagonal matrix. Hence, 

M(s, k) - i(3I = A- l S{(3, k) [si - iA(/3, k)} S(f3, k)~\ 

and since si — iA(j3, k) is diagonal and its diagonal entries have real part greater than or equal to 
Re(s), it follows that 

\lM(s,k)- i pi]- 1 \<\A\\S(t3,k)\\S(P 1 k)- 1 \\[sI-iA(P,k)]- 1 \< 



SRe(s) ' 

with 5 :— (if 2 |A|) _1 . Therefore, the eigenvalues re of M(s, k) must satisfy 

|re-i/3| > SRe(s) 



for all )3 € R. Choosing (3 := Im(re) proves the inequality (139[). Furthermore, since the eigenvalues 



re = re(s, k) can be chosen to be continuous functions of (s, k) [246J, and since for k = 0, M(s, 0) = 

sA _1 , the number of eigenvalues re with positive real part is equal to the number of positive 

eigenvalues of A. □ 

According to this lemma, the Jordan normal form of the matrix M(s, k) has the following form: 

M(s, k) = T{s, k) [D(s, k) + N(s, k)] T(s, k)' 1 , (140) 

with T(s, k) a regular matrix, N(s, k) is nilpotent (N(s, k) m = 0) and 

D(s,k) = diag(rei,...,re g ,re ?+ i,... ,re m ) 

is the diagonal matrix with the eigenvalues of M(s, k), where rei, . . . , re g have negative real part. 
Furthermore, N(s, k) commutes with D(s, k). Transforming to the variable v(s, X, k) := T{s, k)~ l u(s, x, k) 
the boundary value problem (|136 137) simplifies to 



^v=[D(s,k)+N(s,k)]v + T(s,k)- 1 A~ 1 F(s,x 1 ,k), xi > 0, (141) 

bT{s,k)v = g(s,k) aj! = 0. (142) 
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5.1.1 Necessary conditions for well posedness and the Lopatinsky condition 



Having cast the IBVP into the ordinary differential system (141 142 1, we are ready to obtain a 
simple necessary condition for well posedness. For this, we consider the problem for F — and split 
v = (v_, v + ) where t>_ :— (y\, . . . ,v q ) and v+ := (Vq+i, ■ ■ ■ , v m ) are the variables corresponding to 
the eigenvalues of M (s, k) with negative and positive real parts, respectively. Accordingly, we split 

D(sk)-( D -^ k) ° ^ N(sk)-( N - {s ' k) ° 

U[s ' ] -\ D+(s,k) )' ^{ S ,k)-\ Q N+(s,k) 



and bT(s, k) = (b-(s, k), 6+(s, k)). When F = the most general solution of Equation ( 141 1 is 

v-{a,xi,k) = e D -<- s ^ Xl e N - (s ^ Xl <7_{s,k), 
v+(s,xi,k) = e D +^' k)xi e N +^' k ^ Xl a + (s,k), 

with constant vectors (j-(s,k) e C 9 and cr + (s,k) e C m ~ 9 . The expression for v + describes modes 
that grow exponentially in x\ and do not satisfy the required boundary condition at x\ — > oo 



unless cr+{s, k) = 0; hence we set c_|_(s, k) = 0. In view of the boundary conditions (142 1 we then 
obtain the algebraic equation 

&_(s,fc)o-_(s,fc) =g. (143) 

Therefore, a necessary condition for existence and uniqueness is that the r x q matrix 6_(s, k) be 
a square matrix, i.e. r = q, and that 

det(6_(s,fc)) t^O (144) 

for all Re(s) > and k <G M" _1 . Let us make the following observations: 



• The condition ( 144 ) implies that we must specify exactly as many linearly independent 
boundary conditions as there are incoming characteristic fields, since q is the number of 
negative eigenvalues of the boundary matrix A = A 1 . 



• The violation of condition (144) at some (so, ko) with Re(so) > and k € R™ 1 gives rise to 
the simple wave solutions 

u{t,x 1 ,y) = e Sot+tk °- v u{ So ,x 1 ,ko), t > 0, (zi,y)e£, (145) 



whe re u(sq, •, fco) — T(s, k)v(so, ■, fco) € i (0, oo) is a nontrivial solution of the problem ( 141 



142 1 with homogeneous data F = and g = 0. Therefore, an equivalent necessary condi- 
tion for well posedness is that no such simple wave solutions exist. This is known as the 
Lopatinsky condition. 

• If such a simple wave solution exists for some (so, ko), then the homogeneity of the problem 
implies the existence of a whole family, 

u a {t,x 1 ,y) = e a{sot+ik °- y) u(aso,ax 1 ,ak ) 7 t>0, (n,j/)6E, (146) 

of such solutions parametrized by a > 0. In particular, it follows that 

\u a (t, Xl ,y)\ = e aBs ^ t \u(as 0) caB 1) ak )\ = e Jte W*|u a (0 J x 1 ,»)|, 

such that 

I "« (*,si,3/)l = e aRe( s )t ^ ^ 
|"a(0,a;i,2/)| 

for all t > 0, as a — ¥ oo. Therefore, one has solutions growing exponentially in time with an 
arbitrarily large rateF®] 



18 However, it should be noted that these solutions are not square integrable, due to their harmonic dependency 
in y. This problem can be remedied by truncation of u a , see Section 3.2 in I236| . 
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Example 25. Consider the IBVP for the massless Dirac equation in two spatial dimensions (cf. 
Section 8.4.1 in [25511. 



J _J ) u * + ( 1 ) u v> t ~ °' x ^ °' y G M ' " = ( S 

u(0 J s,») = /(a;, tf ) J z>0, t/eK, (148) 

am + bu 2 = g(t, y), f>0, yeR, (149) 



where a and b are two complex constants to be determined. Assuming / = 0, Laplace-Fourier 
transformation leads to the boundary value problem 

u x = M(s, k)u, x>0, M(s, k) = ( . S k ~^ s J (150) 

au\ + bu% = g(s, k), x = 0. (151) 

The eigenvalues and corresponding eigenvectors of the matrix M(s, k) are k± — ±A and e± = 
(ik, s =F A) T , with A := Vs 2 + k 2 , where the root is chosen such that Re(A) > for Re(s) > 0. The 
solution which is square integrable on [0, oo) is the one associated with k_; that is, 

u(s,x,k) = ae~ x e_, 



with a a constant. Introduced into the boundary condition (1511 leads to the condition 

[ika + (s + \)b] a = g(s, k), 

and the Lopatinsky condition is satisfied if and only if the expression inside the square brackets 
on the left-hand side is different from zero for all Re(s) > and k £ M.. Clearly, this implies 6^0 
since otherwise this expression is zero for k = 0. Assuming 6^0 and k ^ 0, we then obtain the 
condition 

z + \J z 1 + l±i- ^ 0, 



for all z :— s/\k\ with Re(z) > 0, which is the case if and only if \a/b\ < 1 or a/b e M, see Figurell] 
The particular case a = 0, b = 1 corresponds to fixing the incoming normal characteristic field u-i 
to g at the boundary. 

Example 26. We consider the Maxwell evolution equations of Example [15] on the half-space 
X\ > 0, and freeze the incoming normal characteristic fields to zero at the boundary. These fields 



are the ones defined in Equation (27 1 which correspond to negative eigenvalues and k — d x r~\ 
hence 

E 1 + ^(W 22 + W 33 )^0, E A + W 1A -(l + a)W Al =0, x x =Q, i A €l, t > 0, (152) 
where A — 2,3 label the coordinates tangential to the boundary, and where we recall that fi — y/a/3, 



assuming that a and /3 have the same sign such that the evolution system (25 26) is strongly 
hyperbolic. In this example, we apply the Lopatinsky condition in order to find necessary conditions 
for the resulting IBVP to be well posed. For simplicity, we assume that /i = y/aj3 — 1, which implies 
that the system is strongly hyperbolic for all values of a ^ 0, but symmetric hyperbolic only if 



-3/2 < a < 0, see Example 15 



^Alternatively, if k = —d x is taken to be the unit outward normal, then the incoming normal characteristic fields 
are the ones with positive characteristic speeds with respect to k. This more geometrical definition will be the one 
taken in Section 15.21 
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Im(z) 




Figure 1: Image of the lines Re(z) — const > under the map 



4L,Z4 



z + Vz 2 + 1. 



In order to analyze the system, it is convenient to introduce the variables U\ := W22 + W33, 
U A := W 1A - (1 + a)W A1 , Z := /3W U - (1 + P/2)Ui, and W^ := Was - SabUx/2 which are 
motivated by the form of the characteristic fields with respect to the direction k — —d\ normal to 
the boundary x\ = 0, see Examp le [TB] With these assumptions and definitions, Laplace-Fourier 
transformation of the system (25 2G) yields 



sEi 
sE A 

sL>i 

sU A 

sZ 

sW A1 

sW AB 



ik A 



= —ad\Ui + 1 
= -d x V A -ik B 



(1 + a)U A + q(2 + a)W M 



W 



BA 



(l + a)W Ai 



aik A 



aZ+(l + a)U! 



dxEx + ^ + a^Ej^ 



-diE A + {l + a)ik A Ei, 



2a 



2a 
= -ik A E x , 

= -ik A E B 



ik A E A , 



5 AB k c E c , 



where we have used = l/o since fi = 1. The last three equations are purely algebraic and can 
be used to eliminate the zero speed fields Z, Wax and W A b from the remaining equations. The 
result is the ordinary differential system 



asUi - (1 + a)ik A E A , 
k\ 2 ' 



d l E l = - 
diUi = - 
OxEa = -sU A + (1 + a)ik A E 



--(2 + a) 
a s 



Ei 



diU A = - 



s + 



E A + {1 + a)- 



1 K,j\K, 



-ik A U A , 



E B — a(l + a)ik A U\ 



:.() 



In order to diagonalize this system, we decompose E A and V ' A into their components parallel and 
orthogonal to k; if k :— k/\k\ and I form and orthonormal basis of the boundary x\ = 0| 20 | then 
these are defined as 



En := k A E Al E x := l A E Al 



V\\ ■= k A U A , 



U x ~ l A U A . 



Then, the system decouples into two blocks, one comprising the transverse quantities (Ex,Ux) 
and the other the quantities (Ex, Ux,Eu,Un). The first block gives 



di 



Ex 
tfx 



s + 



i*-r 



—s 




Ex 
Ux 



and the corresponding solutions with exponential decay at x\ — > oo have the form 



Ex(s,xi,k) 
Ux(s,Xi,k) 



a e 



-Xxi 



(153) 



where a is a complex constant, and where we have defined A := \J s 2 + \k\ 2 with the root chosen 
such that Re(A) > for Re(s) > 0. The second block is 



di 



(Eh. \ 

Ux 



( ° 

-f + (2 + a)^ 
i(l + a)|fc| 



-t(l + a)|&| 



—as 





-ia(l + a)\k\ -s + a(2 + a) 





•l+al 



\ 



|fr| 2 



Ux 
\Un J 



with corresponding decaying solutions 



/ Ei(s,xi,k) \ 
Ui(s,xi,k) 
E\\(s,X!,k) 

\ U\\(s,xuk) ) 



(Tie 



,-Az 



/ i\k\s \ 

-i\k\X 

sX 

\ s 2 -a\k\ 2 J 



a 2 e 



,-Ai 



/ isX \ 

i(s 2 /a-\k\ 2 ) 
\k\s 



(154) 



with complex constants Ci and cr 2 . 

On the other hand, Laplace-Fourier transformation of the boundary conditions (152) leads to 



Ex + aU x =0, E A + U A = 0, xi = 0. 



Introducing into this the solutions (153 154 1 gives 



and 



(s + A)er = 

|fc|(s-aA) sX + s 2 -a\k\ 2 ) 
sA + s 2 -a|fc| 2 \k\(s-aX) 



ox 

(To 



= 0. 



In the first case, since Re(s + A) > Re(s) > 0, we obtain oq = and there are no simple wave 
solutions in the transverse sector. In the second case, the determinant of the system is 

- S 2 [( S + A) 2 -(l + a) 2 |fc| 2 ], 



20 fc is not well-defined if k = 0; however, in this case the scalar block comprising (Ex,U±) decouples from the 
vector block comprising (E a ,Ua) and it is simple to verify that the resulting system does not possess nontrivial 
simple wave solutions. 
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which is different from zero if and only if z + \/z 2 + 1 7^ ±(l + a) for all Re(z) > 0, where z := s/\k\. 
Since a is real this is the case if and only if — 2 < a < 0, see Figure [T] 

We conclude that the strongly hyperbolic evolution system ( 25 261) with a/3 = 1 and incoming 



normal characteristic fields set to zero at the boundary does not give rise to a well posed IBVP 
when a > or a < —2. This excludes the parameter range —3/2 < a < for which the system is 
symmetric hyperbolic. This case is covered by the results in Section [5. 2| below which utilize energy 
estimates and show that symmetric hyperbolic problems with zero incoming normal characteristic 
fields are well posed. 



5.1.2 Sufficient conditions for well posedness and boundary stability 



Next, let us discuss sufficient conditions for the linear, constant coefficient IBVP (132 133| |134J ) 
to be well posed. For this, we first transform the problem to trivial initial data by replacing u(t, x) 
with u{t, x) — e~ t f(x). Then, we obtain the IBVP 



u t =T,^£jU + F{t,x), xeS, t>0, 
i=i 

u(0,x) = 0, x £ E, 

bu = g(t,x) 1 x € <9E, t > 0, 

n 

with F(t,x) = F (t,x) - e _t [/(x) + J2 & ''gfj/O)] and ff(*>a;) replaced by g(t,x) - 

3=1 
applying the Laplace-Fourier transformation to it, one obtains the boundary value problem ( 136 



(155) 

(156) 
(157) 

f 6/(x). By 



1371 which could be solved explicitly, provided the Lopatinsky condition holds. However, in view 



of the generalization to variable coefficients, one would like to have a method which does not rely 
on the explicit representation of the solution in Fourier space. 

In order to formulate the next definition, let ft :— [0, 00) x E be the bulk and T := [0, 00) x <9E 
the boundary surface, and introduce the associated norms || • ||,,,o,o and || • ||,,,o,T defined by 



77,o,n 



\vfi,T 



e- 2rit \u{t,x 1 ,y)\ 2 dtdx 1 d n ' l y^ / \u r ,{t,x)\ 2 dtd n x, 



e- 2r > t \ u (t,0,y)\ 2 dtd n - l y= / \u v {t,Q,y)\ 2 dtd n - l y, 



T 



where we have used the definition of u v as in Equation (135 1. Using Parseval's identities we may 
also rewrite these norms as 



MI?j,o,n 



\ri,0,T 




\u(r] + i£,x ll k)\ 2 d n ~ i k dx x 



df, 



\u(r] + i^0,k)\ 2 d n - 1 k | d$. 
The relevant concept of well posedness is the following one: 



(158) 



(159) 



Definition 6. 1252$ The IBVP (155. 156 157) is called strongly well posed in the generalized 
sense if there is a constant K > such that each compatible data F € Cq°(Q) and g € C^°(T) 
gives rise to a unique solution u satisfying the estimate 



V NI*,o,n + IMI'at < k 2 Qll^Cn + \\g\ti, 



a.T 



(160) 
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for all r\ > 0. 

The inequality ( 160 1 implies that both the bulk norm 



>j,0,O 



and the boundary norm 



\v,o,T 



of u arc bounded by the corresponding norms of F and g. For a trivial source term, F = 0, the 



inequality (1601 implies, in particular, 

IMI„,o,T < K\\g\\ n , , T , r, > 0, (161) 

which is an estimate for the solution at the boundary in terms of the norm of the boundary data 



g. In view of Equation (159) this is equivalent to the following requirement: 



Definition 7. \253l \26(J^ The boundary problem {TE?k 156 151) is called boun dary sta ble if 

there is a constant K > such that all solutions u(s,-,k) £ L 2 (0,oo) of Equations (136 131) with 
F = satisfy 

\u(s,0,k)\<K\g(s,k)\ (162) 

-l 



for all Re(s) > and k £ 

Since boundary stability only requires considering solutions for trivial source terms, F = 0, it 
is a much simpler condition than Equation (160 1. Clearly, well posedness in the generalized sense 



implies boundary stability. The main result is that, modulo technical assumptions, the converse is 
also true: boundary stability implies strong well posedness in the generalized sense. 



Consider the linear, constant coefficient IBVP (155 156, 157) on the 
{{x\,X2, ■ ■ ■ ,x n ) £ R™ : x\ > 0}. Assume that equation (155) is strictly hyperbolic, 



Theorem 5. 

half space £ = 

meaning that the eigenvalues of the principal symbol Po{ik) are distinct for all k £ S 1 ™ -1 . Assume 
that the boundary matrix A — A 1 is invertible. Then, the problem is strongly well posed in the 
generalized sense if and only if it is boundary stable. 

Maybe the importance of Theorem [5] is not so much its statement, which concerns only the 
linear, constant coefficient case for which the solutions can also be constructed explicitly, but rather 
the method for its proof, which is based on the construction of a smooth symmetrizer symbol, and 
which is amendable to generalizations to the variable coefficient case using pseudo-differential 
operators. 

In order to formulate the result of this construction, define p :— \J\s\ 2 + \k\ 2 , s' :— s/p, 
k' := k/p, such that (s 1 , k') £ S r + lies on the half sphere S^ := {(s' , k') £ C x 1" : \s'\ 2 + \k'\ 2 = 
l,Re(s') > 0} for Re(s) > and k £ R"- 1 . Then, we have, 



Theorem 6. 1252$ Consider the linear, constant coefficient IBVP (155, 156 151) on the half 



space S. Assume that equation (155) is strictly hyperbolic, that the boundary matrix A = A is 



invertible, and that the problem is boundary stable. Then, there exists a family of complex m x m 
matrices H(s',k), (s',k) £ S+, whose coefficients belong to the class C°°(5'?), with the following 
properties: 

(i) H{s',k') = H(s',k')* is Hermitian. 

(ii) H(s',k')M(s',k')+M(s',k')*H{s',k') >2Re(s')I for all(s',k') £ 5!f. 

(Hi) There is a constant C > such that 

u*H{s',k')u + C\bu\ 2 > \u\ 2 

for all u£C m and all (s', k') £ S+ . 

Furthermore, H can be chosen to be a smooth function of the matrix coefficients of A 3 and b. 



.V) 



Let us show how the existence of the symmetrizer H(s',k') implies the estimate (1601. First 



F 



using Equation ( 138 1 and properties (i) and (ii) we have 

= pu* [H(s\ k')M(s', k') + M(s', k')*H(s', k')] u + 2Re (u*H(s', k')A~ l 
> 2Re(s)\u\ 2 - d\u\ 2 - ^r\H{s', k')A^F\ 2 , 

where we have used the fact that M(s, k) = pM(s', k') in the second step, and the inequality 
2Re(a*b) < 2|a||fr| < Ci|a| 2 + Cf 1 ^! 2 for complex numbers a and b and any positive constant 
C\ > in the third step. Integrating both sides from x\ = to oo and choosing C\ = Re(s), we 
obtain, using (iii), 



Re(s) J \u\ 2 d Xl < ~[u*H{s\k')u] xi=0 + ^- J \HA- 1 F\ 2 dx 1 
o 

1 




oo 



< 



lul 2 l 



cci=0 



C\~gY 



Re(s) 



\HA-'F\ z dxi. 



(163) 



Since H is bounded, there exists a constant C2 > such that \HA 1 F\ < C^F] for all (s', k 1 ) e S" 
Integrating over ^ = Im(s) £ R and k £ W 1 ^ 1 and using Parseval's identity, we obtain from this 



C 2 



V \\u\\l ,n + h\\lo,T < -r\\ F K,ofi + C \\9\\nfi,T> 



and the estimate (160 1 follows with K 2 :— max{C|,C}. 



Example 27. Let us go back to the Example [25] of the 2D Dirac equation on the halfspace with 
boundary condition (149) at x = 0. The solution of Equations ( 150|151 1 at the boundary is given 
by u(s, 0, k) = a(ik, s + X) T , where A = y/s 2 + k 2 , and 

_ g(s,k) 

ika + (s + X)b' 
Therefore, the IBVP is boundary stable if and only if there exists a constant K > such that 

y/k* + \ S + \\ 2 



\ika + (s + X)b\ 



< K 



for all Re(s) > and k £ E™ 1 . We may assume b ^ 0, otherwise the Lopatinsky condition is 
violated. For k = the left-hand side is 1/|6|. For fc^Owe can rewrite the condition as 



1 yi + i^)i ; 

\b\ \il>(z)±i%\ 



<K, 



for all Re(z) > 0, where ip{z) := z + V ' z 2 + 1 and z := s/\k\. This is satisfied if and only if the 
function \ip(z) ± if | is bounded away from zero, which is the case if and only if \a/b\ < 1, see 
Figure [T] 



This, together with the results obtained in Example 25 yields the following conclusions: the 
IBVP ( |147|148|149| ) gives rise to an ill posed problem if b = or if \a/b\ > 1 and a/b <£ R and to 
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a problem which is strongly well posed in the generalized sense if b ^ and \a/b\ < 1. The case 



M = H 7^ i s covered by the energy method discussed in Section 5.2 below. For the case \a/b\ > 1 
with a/b e E see Section 10.5 in [2"22"j . 

Before discussing second order systems, let us make a few remarks concerning Theorem [5] 



The boundary stability condition (|162) is often called the Kreiss condition. Provided the 



eigenvalues of the matrix M(s, k) are suitably normalized it can be shown 252, 222, 236 that 



the determinant det(6_(s, k)) in Equation (144) can be extended to a continuous function 
defined for all Re(s) > and k £ E™" 1 , and condition (162) can be restated as the following 
algebraic condition: 

det(&_(s,fc)) t^O (164) 

for all Re(s) > and k £ W™ . This is a strengthened version of the Lopatinsky condition, 
since it requires the determinant to be different from zero also for s on the imaginary axis. 

As anticipated above, the importance of the symmetrizer construction in Theorem [6] relies 
on the fact that, based on the theory of pseudo-differential operators, it can be used to treat 
the linear, variable coefficient IBVP 252 . Therefore, the localization principle holds: if all 
the frozen coefficient IBVPs are boundary stable and satisfy the assumptions of Theorem [5] 
then the variable coefficient problem is strongly well posed in the generalized sense. 

If the problem is boundary stable, it is also possible to estimate higher order derivatives of 
the solutions. For example, if we multiply both sides of the inequality (163) by \k\ 2 , integrate 



over £ = Im(s) and k and use Parseval's identity as before, we obtain the estimate ( 160 ) with 



u, F and g replaced by their tangential derivatives u y , F y and g v , respectively. Similarly, one 



obtains the estimate (160) with u, F and g replaced by their time derivatives Ut, F t and g t 



if we multiply both sides of the inequality (163) by \s\ 2 and assume that u t (0,x) = for all 
£ X|^j Then, a similar estimate follows for the partial derivative, d\u, in the iri-direction 



x 



using the evolution equation ( 132 ) and the fact that the boundary matrix A 1 is invertible. 



Estimates for higher order derivatives of u follow by an analogous process. 

Theorem [5] assumes that the initial data / is trivial, which is not an important restriction 
since one can always achieve / = by transforming the source term F and the boundary data 



g, as described below Equation (157). Since the transformed F involves derivatives of /, this 



means that derivatives of / would appear on the right-hand side of the inequality ( 160 1, and 
at first sight it looks like one "loses a derivative" in the sense that one needs to control the 
derivatives of / to one degree higher than the ones of u. However, the results in Refs. [33011331] 
improve the statement of Theorem [5] by allowing nontrivial initial data and by showing that 
the same hypotheses lead to a stronger concept of well posedness (strong well posedness, 
defined below in Definition [9] as opposed to strong well posedness in the generalized sense). 

The results mentioned so far assume strict hyperbolicity and an invertible boundary matrix, 
which are too restrictive conditions for many applications. Unfortunately, there does not 
seem to exist a general theory which removes these two assumptions. Partial results include 
Ref. [B], which treats strongly hyperbolic problems with an invertible boundary matrix that 
are not necessarily strictly hyperbolic, and Ref. [285] . which discusses symmetric hyperbolic 
problems with a singular boundary matrix. 



'One can always assume that u(0, x) = «t(0, x) = for all x 6 S by a suitable redefinition of u, F and g. 
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5.1.3 Second order systems 



It has been shown in Ref. [260 that certain systems of wave problems can be reformulated in such 
a way that they satisfy the hypotheses of Theorem [6] In order to illustrate this, we consider the 
IBVP for the wave equation on the half-space E := {{xi, X2, ■ ■ ■ , x n ) £ K" : x\ > 0}, n > 1, 



v tt = Av + F(t,x), xeT,, t>0, 
v(0,x) = 0, v t (0,x) = 0, ieS, 

Lv = g(t,x), igffi, t > 0, 



(165) 
(166) 
(167) 



where F g C^°([0, oo) x E) and g £ Cg°([0, oo) x <9E), and where L is a first order linear differential 
operator of the form 

d 



L-=a--b— 
dt dxi 



E c ^ 



dxj 



where a, b, C2, 



3=2 -*3 

, c n are real constants. We ask under which conditions on these constants the 



IBVP (165 166 167) is strongly well posed in the generalized sense. Since we are dealing with a 
second order system, the estimate (1601 in Definition [6] has to be replaced with 



where the norms 



V lK,i,n + Nftvr < K 2 Qll^Co + IMI*,o,r) . 
"i x q and || • ||? ! 7- control the first partial derivatives of v, 

dv 



(168) 



||," 2 



77,1,0 



\vA,r 



,-2vt 



,-277* 



T 



E 

n 

E 

fi=0 



dx^ 

dv 
dx^ 



(t,xi,y) 



(t,o,y) 



dtdxitf 1 y, 



dtdT-iy, 



with (x^) = (t, x\, X2, ■ ■ ■ , x n ). Likewise, the inequality ( 162 ) in the definition of boundary stability 
needs to be replaced by 

\u(s,0,k)\<K |g(5 ' fc)l . (169) 

vw+w 

Laplace-Fourier transformation of Equations ( 165 167[ ) leads to the second order differential 
problem 



dx\ 



v = (s 2 + \k\ 2 )i- F, xi > 0, 



b— — v = (as — ic(k))v — g, x\= 0, 
ox 1 



(170) 
(171) 



where we have defined c(k) := ^ Cjkj and where F and g denote the Laplace-Fourier transforma- 

3=2 

tions of F and g, respectively. In order to apply the theory described in the previous subsection, 
we rewrite this system in first order pseudo-differential form. Defining 



pv 

dv 
dxi 




F 



02 



where p := \f\s\ 2 + |fc| 2 , we find 

d 



. ;b u = Af («,*)« + /, Sl >0, (172) 

L(s,k)u — g, x\ = 0, (173) 



where we have defined 



M(s, fc):=p( s , 2 + | fc ,|2 ), L(s,fc) :=(as'-ic(fe'))-&) 



with s' := s/p, k' := fc/p. This system has the same form as the one described by Equations (138 



137), and the eigenvalues of the matrix M(s, k) are distinct for Re(s) > and k £ R n_1 . Therefore, 
we can construct a symmetrizer H(s',k') according to Theorem p^ provided that the problem is 
boundary stable. In orde r to check boundary stability, we diagonalize M(s, k) and consider the 



solution of Equation (172) for / = which decays exponentially as x\ —¥ oo, 



u(s,xi,k) = cr_e Xxi ( 
where u_ is a complex constant and A := */ ' s 2 + \k\ 2 with the root chosen such that Re(A) > for 



Re(s) > 0. Introduced into the boundary condition (173) this gives 

[as' + bX' - ic(k')} <r_ = -, 

and the system is boundary stable if and only if the expression inside the square parenthesis is 
different from zero for all Re(s') > and k! £ R™ -1 with \s'\ 2 + \k'\ 2 = 1. In the one-dimensional 
case, n = 1, this condition reduces to (a + b)s' — with \s'\ = 1, and the system is boundary stable 
if and only if a + b =/= 0, that is, if and only if the boundary vector field L is not proportional to 
the ingoing null vector at the boundary surface, 

dt dx\ 

Indeed, if a + b = 0, Lu = a(u t + u Xl ) is proportional to the outgoing characteristic field, for which 
it is not permitted to specify boundary data since it is completely determined by the initial data. 
When n > 2 it follows that b must be different from zero since otherwise the square parenthesis 
is zero for purely imaginary s' satisfying as' — ic(k'). Therefore, one can choose 6=1 without loss 
of generality. It can then be shown that the system is boundary stable if and only if a > and 

n 

Y^, \cj\ 2 < a 2 , see Ref. [260 , which is equivalent to the condition that the boundary vector field L 

i=2 

is pointing outside the domain, and that its orthogonal projection onto the boundary surface 7", 

■ a dt 2^ c i dx > 

is future-directed time-like. This includes as a particular case the "Sommerfeld" boundary condi- 
tion ut — u Xl = for which L is the null vector obtained from the sum of the time evolution vector 
field d t and the normal derivative N = — d Xl . While N is uniquely determined by the boundary 
surface T, dt is not unique since one can transform it to an arbitrary future-directed time-like 
vector field T which is tangent to T by means of an appropriate Lorentz transformation. Since the 
wave equation is Lorentz- invariant, it is clear that the new boundary vector field L = T + N must 



(>:-} 



also give rise to a well-posed IBVP, which explains why there is so much freedom in the choice of 
L, 

For a more geometric derivation of these results based on estimates derived from the stress- 
energy tensor associated to the scalar field v, which shows that the above construction for L is 
sufficient for strong well-posedness, see Appendix B in Ref. |256j . For a generalization to the shifted 
wave equation, see Ref. |357j . 



As pointed out in [260J, the advantage of obtaining a strong well-posedness estimate (168) for 
the scalar wave problem is the fact that it allows the treatment of systems of wave equations where 
the boundary conditions can be coupled in a certain way through terms involving first derivatives 
of the fields. In order to illustrate this with a simple example, consider a system of two wave 
equations, 

s)„= A U) + (^)< « s > <- 

which is coupled through the boundary conditions 

dt dxi) \v 2 J \ v 2 J \ g 2 {t,x) ) 

where N has the form 



X J ' dt dx\ dx n ' 

with (X ^ 1 , . . . ,X n ) 6 C ,l+1 any vector. Since the wave equation and boundary condition for 



i>i decouples from the one for v 2 , we can apply the estimate (168) to v\, obtaining 



flMfti.n + IH&i.r <K 2 Qll^ll",o,n + ll5i||',o,r) • (175) 

If we set 53 (i, x) :— g 2 (t, %) + Xvi(t, x), t > 0, x € <9£, we have a similar estimate for v 2 , 

V ||t*||* lf0 + ||«2||^i ir < K 2 Q||F 2 ||= i0in + ||g 3 ||^o,r) • (176) 



However, since the boundary norm of v\ is controlled by the estimate (175), one also controls 



IMI*,o,r < allflall'.o.r + C 2 |H|^, r < ^^ll^ill^ + {CKf\\ gi \\l Q , T + 2\\g 2 \\l 0tT 



with some constant C > depending only on the vector field X. Therefore, the inequali- 
ties (|175|176) together yield an estimate of the form (168) for v = (vi,v 2 ), F = (Fi,F 2 ) and 



g = (51,52), which shows strong well posedness in the generalized sense for the coupled system. 
Notice that the key point which allows the coupling of v\ and v 2 through the boundary matrix 



operator N is the fact that one controls the boundary norm of v\ in the estimate ( 175 ). The result 
can be generalized to larger systems of wave equations, where the matrix operator N is in trian- 
gular form with zero on the diagonal, or where it can be brought into this form by an appropriate 
transformation [26011257] . 

Example 28. As an application of the theory for systems of wave equations which are coupled 
through the boundary conditions, we discuss Maxwell's equations in their potential formulation on 
the half space £ [260J . In the Lorentz gauge and the absence of sources, this system is described 
by four wave equations d^d^A u = for the components (A t ,A x ,A y ,A z ) of the vector potential 



6-1 



0, where we use the Einstein summation 



A„, which are subject to the constraint C :— d^A^ 
convention. 

As a consequence of the wave equation for A v , the constraint variable C also satisfies the wave 
equation, d^d^C = 0. Therefore, the constraint is correctly propagated if the initial data is chosen 
such that C and its first time derivative vanish, and if C is set to zero at the boundary. Setting 
C = at the boundary amounts in the following condition for A v at x = 0: 



dt 



dA x dA v 
dx dy 



dA z 
dz 



(A t + A x ) = - - 



which can be rewritten as 

d__d_ 
dt dx 

Together with the boundary conditions 

d_ d_ 
dt dx 
d_ 
dt 
d_ 
dt 



d_ 
dt 



dx 



{A t - A x 



d_ 
dx 
d^ 

dx 



A, 



d 
(A t -A x ) + 2~A 

-- 0, 
d 



2°-A z . 
dz 



(177) 



A z = 



dy 

d_ 

dz 



(A t - A x ) 
(A t -A x ), 



this yields a system of the form of Equation (174) with N having the required triangular form, 
where v is the four component vector function v — (A t — A x ,A y , A z ,A t + A x ). Notice that the 
Sommerfeld-like boundary conditions on A y and A z set the gauge- invariant quantities E y + B z and 
E z — By to zero, where E and B are the electric and magnetic fields, which is compatible with an 
outgoing plane wave traveling in the normal direction to the boundary. 

For a recent development based on the Laplace method which allows the treatment of second 
order IBVPs with more general classes of boundary conditions, including those admitting boundary 
phenomena like glancing and surface waves, sec [255 1 . 

5.2 Maximal dissipative boundary conditions 

An alternative technique for specifying boundary conditions which does not require Laplace- Fourier 
transformation and the use of pseudo-differential operators when generalizing to variable coeffi- 
cients is based on energy estimates. In order to understand this, we go back to Section 3.2. 3[ where 
we discussed such estimates for linear, first order symmetric hyperbolic evolution equations with 



symmetrizer H(t, x). We obtained the estimate (55 1, bounding the energy E(E t ) = J s J°(t, x)d n x 



at any time t £ [0, T] in terms of the initial energy E(Eq), provided that the flux integral 
e^J^(t, x)dS, J M (i, x) := -u(t, x)*H(t, x)A f "(t, x)u(t, x) 



T 

was nonnegative. Here, the boundary surface is T — [0,T] x 9S, and its unit outward normal 
e = (0, si, . . . , s n ) is determined by the unit outward normal s to 9S. Therefore, the integral is 
nonnegative if 

u(t, x)*H(t, x)P Q (t, x, s)u(t, x) < 0, (*, x) e T, (178) 

n 

where Po(t,x,s) — ^2 A 3 (t 1 x)sj is the principal symbol in the direction of the unit normal s. 
Hence, the idea is to specify homogeneous boundary conditions, b(t, x)u = at T, such that the 



G5 



condition ( 178 1 is satisfiedF^ In this case, one obtains an a priori energy estimate as in Section 3.2.3 



Of course, there are many possible choices for b(t, x) which fulfill the condition ( 178 ); however, an 



additional requirement is that one should not overdetermine the IBVP. For example, setting all 
the components of u to zero at the boundary does not lead to a well posed problem if there are 
outgoing modes, as discussed in Section [5. 1.1| for the constant coefficient case. Correct boundary 
conditions turn out to be a minimal condition on u for which the inequality ( 178 1 holds. In other 



words, at the boundary surface, u has to be restricted to a space for which Equation (1781 holds 
and which cannot be extended. The precise definition which captures this idea is: 

Definition 8. Denote for each boundary point p = (t, x) £ T the boundary space 

V p := {u E C m : b(t, x)u = 0} C C m 

of state vectors satisfying the homogeneous boundary condition. V p is called maximal nonpositive 
if 

(i) u* H(t, x)Po(t 1 x, s)u < for all u G V p , 

(ii) V p is maximal with respect to condition (i); that is, if W p D V p is a linear subspace of C m 
containing V p which satisfies (i), then W p = V p . 

The boundary condition b(t, x)u — g(t, x) is called maximal dissipative if the associated boundary 
spaces V p are maximal nonpositive for all peT. 

Maximal dissipative boundary conditions were proposed in Refs. [183, 267 in the context of 
symmetric positive operators, which include symmetric hyperbolic operators as a special case. 
With such boundary conditions, the IBVP is well posed in the following sense: 

Definition 9. Consider the linearized version of the IBVP ( 129\ 130\ 131), where the matrix 
functions A°{t,x) and b(t,x) and the vector function F(t,x) do not depend on u. It is called 
well posed if there are constants K = K{T) and e = s(T) > such that each compatible data 
f € C£°(£,C m ) and g <E C£°([0,T] x <9£,C) gives rise to a unique C°° -solution u satisfying the 
estimate 



\u(t,-)f 



£=(£) 



\u{s,-)\\h(dj:) ds < K 2 



I/IIl 2 (e) 



(\\F(s,-)\\ 



2 
L=(S) 



\9(s,-)\\h( ds) ) (!■- 



(179) 

for all t € [0, T] . If, in addition, the constant e > can be chosen strictly positive, the problem is 
called strongly well posed. 

This definition strengthens the corresponding definition in the Laplace analysis, where trivial 
initial data was assumed and only a time-integral of the L 2 (S)-norm of the solution could be 
estimated (see Definition [6]) . The main result of the theory of maximal dissipative boundary 
conditions is: 



Theorem 7. Consider the linearized version of the IBVP (129 ISO, 131), where the matrix 
functions A J (t,x) and b(t,x) and the vector function F(t,x) do not depend on u. Suppose the 
system is symmetric hyperbolic, and that the boundary conditions (131) are maximal dissipative. 
Suppose, furthermore, that the rank of the boundary matrix Po(t,x,s) is constant in (t,x) G T . 

Then, the problem is well posed in the sense of Definition [M Furthermore, it is strongly well 
posed if the boundary matrix Po(t,x,s) is invertible. 



22 The restriction to homogeneous boundary conditions g 
can always be achieved by redefining u, f and F. 



in the IBVP dl29 130 13ll is not severe, since it 
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This theorem was first proven in [1831 12671 [333 for the case where the boundary surface T 
is non-characteristic, that is, the boundary matrix Po(t, x, s) is invertible for all (t, x) € T . A 
difficulty with the characteristic case is the loss of derivatives of u in the normal direction to 
the boundary (see Ref. [410] ). This case was studied in [2851 13321 1375] . culminating with the 
regularity theorem in [375J which is based on special function spaces which control the L 2 -norms 
of Ik tangential derivatives and k normal derivatives at the boundary (see also Ref. [377] ). For 
generalizations of Theorem [7] to the quasilinear case, see Refs. [2121 1376] . 

A more practical way of characterizing maximal dissipative boundary conditions is the following. 
Fix a boundary point (t, x) G T, and define the scalar product (•, •) by (u,v) := u*H(t, x)v, 
u, v € C m . Since the boundary matrix Po(t, x, s) is Hcrmitian with respect to this scalar product, 
there exists a basis ei,e%, . .., e m of eigenvectors of Po(t, x, s) which are orthonormal with respect 
to (•, ■). Let Ai, A2, . . . , X m be the corresponding eigenvalues, where we might assume that the first 
r of these eigenvalues are strictly positive, and the last s are strictly negative. We can expand any 

m 

vector u € C m as u = Y^ vP^Gj, the coefficients u^ being the characteristic fields with associated 



speeds Xj. Then, the condition (178) at the point p can be written as 



o>(u,p (t,x, s )u) = Y,^M f) \ 2 = J2 x ^ uU) \ 2 - E i^-h^i 2 ' ( 18 °) 

j=l j=l j=m—s+l 

where we have used the fact that Ai, . . . , A r > 0, A m _ s +i, . . . , A m < and the remaining A/s are 
zero. Therefore, a maximal dissipative boundary condition must have the form 




u+ = qu-, u+:= ... , u- := ... , (181) 

V «« J V u{m) J 

with q a complex r x s matrix, since u_ = must imply u + = 0. Furthermore, the matrix q has to 




be small enough such that the inequality (180 1 holds. There can be no further conditions since an 



additional, independent condition on u would violate the maximality of the boundary space V p 



In conclusion, a maximal dissipative boundary condition must have the form of Equation ( 181 ) 
which describes a linear coupling of the outgoing characteristic fields u_ to the incoming ones, u + . 
In particular, there are exactly as many independent boundary conditions as there are incoming 
fields, in agreement with the Laplace analysis in Section [5.1. 1| Furthermore, the boundary condi- 
tions must not involve the zero speed fields. The simplest choice for q is the trivial one, q = 0, in 
which case data for the incoming fields is specified. A nonzero value of q would be chosen if the 
boundary is to incorporate some reflecting properties, like the case of a perfect conducting surface 
in electromagnet ism, for example. 

Example 29. Consider the first order reformulation of the Klein-Gordon equation for the 



variables u = ($, <fr t , $>x, $y)> see Example 13 Suppose the spatial domain is x > 0, with the 
boundary located at x — 0. Then, s — (— 1, 0) and the boundary matrix is 



Po(s) = - 



/ \ 

10 

10 

\ / 



Therefore, the characteristic fields and speeds are $, $ y (zero speed fields, A = 0), $t — Q x 
(incoming field with speed A = 1) and $ 4 + & x (outgoing field with speed A = — 1). It follows from 



Equations (180 181) that the class of maximal dissipative boundary conditions is 
(*t-*x) = ?(t,iO(*t + *»)+ff(*,iO, *>0, yeR, 
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where the function q satisfies \q(t, y)\ < 1 and g is smooth boundary data. Particular cases are: 

• q = 0: Sommcrfeld boundary condition, 

• q = — 1: Dirichlet boundary condition, 

• q = 1: Neumann boundary condition. 

Example 30. For Maxwell's equations on a domain £ C M 3 with C°°-boundary 9E, the 
boundary matrix is given by 



see Example 14 In terms of the components E\\ of E parallel to the boundary surface <9£, and the 
ones E± whicn are orthogonal to it (and, hence, parallel to s) the characteristic speeds and fields 



are, 

0: E x , B x , 

±1: E ll ±sAB ll . 

Therefore, maximal dissipative boundary conditions have the form 

(£ N + s A 5||) = g(£|| - s A B||) + 5 ||, (182) 

with gu some smooth vector- valued function at the boundary which is parallel to 9S, and q a 
matrix-valued function satisfying the condition \q\ < 1. Particular cases are: 

• g = — 1, <7|| = 0: The boundary condition E\\ = describes a perfectly conducting boundary 
surface. 

• q = 0, <7|| = 0: This is a Sommerfeld-type boundary condition, which, locally, is transparent 
to outgoing plane waves traveling in the normal direction s, 

E(t,x)^£e l ^ t ~ k - X \ B(t,x) =sAE(t,x), 

where ui the frequency, k = los the wave vector, and £ the polarization vector which is 
orthogonal to k. The generalization of this boundary condition to inhomogeneous data 
mi 7^ allows to specify data on the incoming field Eu + s A B\\ at the boundary surface, 
which is equal to 2£e lLJt for the plane waves traveling in the normal inward direction — s. 

Recall that the constraints V ■ E = p and V • B = propagate along the time evolution vector field 
d t , {W-E — p) t = 0, (V--B)t = 0, provided the continuity equations holds. Since d t is tangent to the 
boundary, no additional conditions controlling the constraints must be specified at the boundary; 
the constraints are automatically satisfied everywhere provided they are satisfied on the initial 
surface. 

Example 31. Commonly, one writes Maxwell's equations as a system of wave equations for 



the electromagnetic potential A^ in the Lorentz gauge, as discussed in Example 28 By reducing 
the problem to a first order symmetric hyperbolic system, one may wonder if it is possible to 
apply the theory of maximal dissipative boundary conditions and obtain a well posed IBVP, as in 



the previous example. As we shall see in Section |5.2.1 the answer is affirmative, but the correct 



application of the theory is not completely straightforward. In order to illustrate why this is the 



G8 



case, introduce the new independent fields D^ u :— d^A u . Then, the set of wave equations can be 
rewritten as the first order system for the 20-component vector (A v , D tv , Dj V ), j = x,y,z, 

d t A v = D tv , d t D tu = d 3 D jv , d t D jv = djD tv , 

which is symmetric hyperbolic. The characteristic fields with respect to the unit outward normal 
s = ( — 1,0,0) at the boundary are 

D t v - D xv = (d t - d x )A u (incoming field) 

D tv + D xv = (d t + d x )A y (outgoing field) 

D yu = dyAy (zero speed field) 

D zv — d z A v (zero speed field) 



According to Equation (177) we can rewrite the Lorentz constraint in the following way: 



(At " Art) + (A* - Ax) - "(At + At) + (D tx + D xx ) + 2D yy + 2D ZZ . 

The problem is that when written in terms of the characteristic fields, the Lorentz constraint 
not only depends on the in- and outgoing fields, but also on the zero speed fields D yy and D zz . 
Therefore, imposing the constraint on the boundary in order to guarantee constraint preservation 
leads to a boundary condition which couples the incoming fields to outgoing and zero speed fields Y\ 
which does not fall in the class of admissaadmissibleble boundary conditions. 

At this point, one might ask why we were able to formulate a well-posed IBVP based on the 
second order formulation in Example [28j while the first order reduction discussed here fails. As we 
shall see, the reason for this is that there exist many first order reductions which are inequivalent 
to each other, and a slightly more sophisticated reduction works, while the simplest choice adopted 
here does not. See also Refs. [3401 \T2\ for well posed formulations of the IBVP in electromagnetism 
based on the potential formulation in a different gauge. 

Example 32. A generalization of Maxwell's equations is the evolution system 

d t E l3 = -e kl{i d k B l Jh (183) 

d t B i:j = +e kl{i d k E l j) , (184) 

for the symmetric, trace-free tensor fields A? and Aj, where here we use the Einstein summation 
convention, the indices i, j, k, I run over 1, 2, 3, (ij) denotes symmetrization over ij, and Sijk is the 



totally antisymmetric tensor with e 12 3 = 1. Notice that the right-hand sides of Equations (183 



184) are symmetric and trace-free, such that one can consistently assume that E t i — B% = 0. The 



evolution system (183 184) which is symmetric hyperbolic with respect to the trivial symmetrizer 
describes the propagation of the electric and magnetic parts of the Weyl tensor for linearized 
gravity on a Minkowski background, see for instance Ref. [177] , 

Decomposing Aj into its parts parallel and orthogonal to the unit outward normal s, 

Eij = E I SiSj - -7^ I + 2s (i A) + Aj, 



23 Instead of imposing the constraint itself on the boundary one might try to set some linear combination of its 
normal and time derivatives to zero, obtaining a constraint-preserving boundary condition that does not involve zero 
speed fields. Unfortunately, this trick only seems to work for reflecting boundaries, see Refs. [3931 and [103| for the 
case of general relativity. In our example, such boundary conditions are given by dt At = d x A x = dtA y = dtA z = 
which imply dt(d v A v ) = 0. 
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where 7^ := 5 io - SiSj, E := s i S i E ij , E { ■- "/ i k E kj s j , E %1 := (7^7] - -f. l:j -f kl /2)E ku and similarly 
for Bij, the eigenvalue problem Au = Pq(s)u for the boundary matrix is 



XE -- 


-- 0, 


XB -- 


= 0, 


XEi -- 




XB, -- 


= + \e kl ^E\ 


XEij - 


- —Ekl(iS B j) 


XBij - 


= +Ekl(iS E j-j 



from which one obtains the following characteristic speeds and fields, 

: E, B, 

±~: E tT e kHS k B l , 
±1: E lj T£ki( i s k B l j) . 

Similarly to the Maxwell case, the boundary condition E^ — Ski(iS k B l j- j = on the incoming, 
symmetric trace- free characteristic field is, locally, transparent to outgoing linear gravitational 
plane waves traveling in the normal direction s. In fact, this condition is equivalent to setting to 
zero at the boundary surface the complex Weyl scalar ^q, computed from the adapted, complex 
null tetrad K :— dt + S, L := dt — s, Q, QJ^j Variants of this condition have been proposed in 
the literature in the context of the IBVP for Einstein's field equations in order to approximately 
control the incoming gravitational radiation, see Refs. [T5TI 1551 I2T71 I5B51 I5CT1 I5UTI 12751 15721 1553] . 
However, one also needs to control the incoming field Ei — Ekus k B l at the boundary. This 
field, which propagates with speed 1/2, is related to the constraints in the theory. Like in elec- 
tromagnetism, the fields E^ and Bij are subject to the divergence constraints Pj := d l Eij = 0, 
Qj := d i Bij = 0. Unlike the Maxwell case, however, these constraints do not propagate trivially. 



As a consequence of the evolution equations (183 184), the constraint fields Pj and Qj obey 



d t P 3 = ~\e ]k id k Q l , d t Qj = +^e ]kl d k P\ 

which is equivalent to Maxwell's equations except that the propagation speed for the transverse 
modes is 1/2 instead of 1. Therefore, guaranteeing constraint propagation requires specifying 
homogeneous maximal dissipative boundary conditions for this system, which have the form of 



Equation (182 1 with E h-> P, B h> — Q and <?|| = 0. A problem is that this yields conditions 
involving first derivatives of the fields Eij and Bij , when rewritten as a boundary condition for the 



main system (183 184 1. Except in some particular cases involving totally reflecting boundaries, it 
is not possible to cast these conditions into maximal dissipative form. 

A solution to this problem has been presented in Refs. [176j and [181] . where a similar system 
appears in the context of the IBVP for Einstein's field equations for solutions with anti-de Sitter 
asymptotics, or for solutions with an artificial boundary, respectively. The method consists in 



modifying the evolution system ( 183 184 ) by using the constraint equations Pj = Qj = in such 



a way that the constraint fields for the resulting boundary adapted system propagate along dt at 



24 This relation is explicitly given in terms of the Weyl tensor C, namely ^o = C a p~ l $K a QP K~<Q 6 = 2(E a p — 
£~fS(a sl B fi))Q a Q > where E a p = Ca^pgT^T and B a p = *C al pgT^T are the electric and magnetic parts of C 
with respect to the timelike vector field T = dt- 
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the boundary surface. In order to describe this system, extend s to a smooth vector field on E 
with the property that \s\ < 1. Then, the boundary adapted system reads: 



9tB„ = 



£fci(i<9 B j) + S(iSj) k iS Q , 



k t?1 



-£u(id E' 



'(i fc i) 



k r>l 



£j)klS P' 



(185) 
(186) 



This system is symmetric hyperbolic, and the characteristic fields in the normal direction are iden- 
tical to the unmodified system with the important difference that the fields En =p ekus k B l now 
propagate with zero speed. The induced evolution system for the constraint fields is symmetric 
hyperbolic, and has a trivial boundary matrix. As a consequence, the constraints propagate tan- 
gentially to the boundary surface, and no extra boundary conditions for controlling the constraints 
must be specified. 



5.2.1 Application to systems of wave equations 

As anticipated in Example |31[ the theory of symmetric hyperbolic first order equations with 
maximal dissipative boundary conditions can also be used to formulate well posed IBVP for systems 
of wave equations which are coupled through the boundary conditions, as already discussed in 
Section [5. 1.3| based on the Laplace method. Again, the key idea is to show strong well posedness; 
that is, an a priori estimate which controls the first derivatives of the fields in the bulk and at the 
boundary. 

In order to explain how this is performed, we consider the simple case of the Klein-Gordon 
equation $ tt = A$ — m 2 $ on the half plane S:={(i,!/)el 2 :i>0}. In Example 13 we reduced 
the problem to a first order symmetric hyperbolic system for the variables u = ($, $t, $> x , <fr y ) 



with symmetrizer H = diag(m , 1, 1, 1), and in Example 29 we determined the class of maximal 
dissipative boundary conditions for this first order reduction. Consider the particular case of 
Sommerfeld boundary conditions, where <I> t = <fr x is specified at x = 0. Then, Equation (52 1 gives 
the following conservation law, 



£(E T ) = E(Z ) +11 u*HP (s)u\ x=0 dydt, 



(187) 



where E(E t ) = f^ t U*Hudxdy = / Et (m 2 |$| 2 + |$ t | 2 + |$ x | 2 + |$ y | 2 ) dxdy, and u*HP (s)u = 
— 2Re($j $a;), see Example [29| Using the Sommerfeld boundary condition, we may rewrite — 2Re($^ $ x ) 
— (\®t\ 2 + \®x\ 2 ), and obtain the energy equality 



i 
E(E T ) + J J [l**| 2 + \$ x \ 2 ] x=0 dydt = £(E ) 



controlling the derivatives of <&t and <fr x at the boundary surface. However, a weakness of this 
estimate is that it does not control the zero speed fields $ and $j, at the boundary, and so one 
does not obtain strong well posedness. 

On the other hand, the first order reduction is not unique, and as we show now, different 
reductions may lead to stronger estimates. For this, we choose a real constant b such that < 
b < 1/2 and define the new fields u :— ($, $t — b$ x , & x , $ y ), which yield the symmetric hyperbolic 
system 



u t 



( b 





\ 




/ \ 
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-b 2 
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with symmetrizer H = diag(m 2 , 1,1 — 6 2 , 1). The characteristic fields in terms of $ and its deriva- 
tives are $, $ y , $t + $> x , and $* — 3>a,, as before. However, the fields now have characteristic speeds 
—6, —6, — 1, +1, respectively, whereas in the previous reduction they were 0, 0, —1, +1. Therefore, 
the effect of the new reduction versus the old one is to shift the speeds of the zero speed fields, and 
to convert them to outgoing fields with speed —6. Notice that the Sommerfeld boundary condition 
$ t = $2 is still maximal dissipative with respect to the new reduction. Repeating the energy 
estimates again leads to a conservation law of the form ( |187[ ), but where now the energy and flux 
quantities are £(£ t ) = J E u*Hudxdy = / E (m 2 |$| 2 + |$ t - b$ x \ 2 + (1 - 6 2 )|$ x | 2 + |*y| 2 ) dxdy 
and 

u*HP (s)u = -b [m 2 \<5>\ 2 + |$ t | 2 + |$ x | 2 + |$,| 2 ] + 26 [|$ t | 2 + I*.! 2 ] - 2Re($£$ iC ). 

Imposing the boundary condition $ 4 = $ x at a; = and using 26 < 1 leads to the energy estimate 

T 

£(£ T ) + bj y [m 2 |$| 2 + |$ t | 2 + |* 8 | a + |* v | a ] x=0 dydt < £(Eo), 

R 

controlling $ and all its first derivatives at the boundary surface. 

Summarizing, we have seen that the most straightforward first order reduction of the Klein- 
Gordon equation does not lead to strong well-posedness. However, strong well-posedness can be 
obtained by choosing a more sophisticated reduction, in which the time-derivative of $ is replaced 
by its derivative $t — 6$ x along the time-like vector (1,-6) which is pointing outside the domain 
at the boundary surface. In fact, it is possible to obtain a symmetric hyperbolic reduction leading 
to strong well posedness for any future-directed time-like vector field u which is pointing outside 
the domain at the boundary. Based on the geometric definition of first order symmetric hyperbolic 
systems in |199j , it is possible to generalize this result to systems of quasilinear wave equations on 
curved backgrounds |257j . 

In order to describe the result in [257], let 7r : E — > M be a vector bundle over M — [0, T] x £ 
with fibre M. N , let V M be a fixed, given connection on E and let g M „ = <7 MI ,(<I>) be a Lorentz metric 
on M with inverse g^ v (<fr) which depends pointwise and smoothly on a vector- valued function 
$ = {$ a }a=i,2....,n 7 parameterizing a local section of E. Assume that each time-slice £j = {£} x £ 
is space- like and that the boundary T = [0, T] x <9£ is time-like with respect to g r /iV ($). We consider 
a system of quasilinear wave equations of the form 

ff ^($)V M V„$ A = F A ($, V<1>), (188) 

where F ($, V$) is a vector-valued function which depends pointwise and smoothly on its argu- 



ments. The wave system ( 188 1 is subject to the initial conditions 



$ A | Eo =<, r/V^| Eo =ltf, (189) 

where $ A and n^ are given vector- valued functions on £ , and where n M = n M ($) denotes the 
future-directed unit normal to £ with respect to <7 M „. In order to describe the boundary conditions, 
let T M = T ti (p, $), p <E 7", be a future-directed vector field on T which is normalized with respect 
to <7 M „ and let A"' = N^(p, $) be the unit outward normal to T with respect to the metric g^ v . 
We consider boundary conditions on T of the following form 

fT" + aN»] V M $ A | r = <f A B V M $ B | r + d A B $ B | r + G A , (190) 

where a — a{p 1 $) > is a strictly positive, smooth function, G A = G A (p) is a given, vector-valued 
function on T and the matrix coefficients c^ A b — c^ a b(p,^) and d A B = d A B(p,&) are smooth 
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functions of their arguments. Furthermore, we assume that c M g satisfies the following property. 



Given a local trivialization ip : U x 



sJV 



I— > 7T 



(U) of J5 such that U C M is compact and contains a 



portion W of the boundary T, and given e > 0, there exists a smooth map J : U —> GL(N. i),pH4 
(J b(p)) such that the transformed matrix coefficients 



C» A B : = 



J A cc 



^dIJ-Yb 



are in upper triangular form with zeroes on the diagonal, that is 



C» A B 



0. 



B < A. 



Theorem 8. ggffi The IBVP \188\\189\\190\ is well posed. Given T > and sufficiently small and 
smooth initial and boundary data $^, IIq and G satisfying the compatibility conditions at the edge 
S = {0} x dYi, there exists a unique smooth solution on M satisfying the evolution equation \188 ), 
the initial condition (189) and the boundary condition (190). Furthermore, the solution depends 
continuously on the initial and boundary data. 

Theorem [8] provides the general framework for treating wave systems with constraints, such 
as Maxwell's equations in the Lorentz gauge and, as we will see in Section 6.1 Einstein's field 
equations with artificial outer boundaries. 



5.2.2 Existence of weak solutions and the adjoint problem 

Here, we show how to prove the existence of weak solutions for linear, symmetric hyperbolic 
equations with variable coefficients and maximal dissipative boundary conditions. The method 
can also be applied to a more general class of linear symmetric operators with maximal dissipative 
boundary conditions, see Refs. [1831 1267] , The proof below will shed some light on the maximality 
condition for the boundary space V p . 



Our starting point is an IBVP of the form ( 129 130 



131 1 where the matrix functions A° {t, x) 
and b(t, x) do not depend on u, and where F(t, x, u) is replaced by B{t, x)u + F(t, x), such that the 
system is linear. Furthermore, we can assume that the initial and boundary data is trivial, / = 0, 
g = 0. We require the system to be symmetric hyperbolic with symmetrizer H{t, x) satisfying the 
conditions in Definition BTiii) , and assume the boundary conditions (131) are maximal dissipative. 
We rewrite the IBVP on VLt := [0, T] x S as the abstract linear problem 



Lu = F, 



(191) 



where L : D(L) C X — > X is the linear operator on the Hilbert space X := L 2 (VLt) defined by the 
evolution equation and the initial and boundary conditions: 

D{L) := {u e C^°(fi T ) : u(p) = for all peE and u(p) e V p for all peT}, 
du 



Lu 



p=0 



x w + B(f ' x)u ' 



ueD(L), 



where we have defined A := —I and x° := t, where V p = {u € C m : b(t, x)u = 0} is the boundary 
space, and where S := {0} x S, St := {T} x S and T := [0,T] x 9S denote the initial, the final 
and the boundary surface, respectively. 

For the following, the adjoint IBVP plays an important role. This problem is defined as follows. 
First, the symmetrizer defines a natural scalar product on X, 



(v,u)h'= / v*(t,x)H(t,x)u(t,x) dtd n x, 



u. v £ X, 
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which, because of the properties of H , is equivalent to the standard scalar product on L 2 (Qt). In 
order to obtain the adjoint problem, we take u E D(L) and v E C^°(Ht), and use Gauss's theorem 
to find 

(v,Lu)„ = (L*v,u) H + fv*Hud n x- { v*Hud n x+ f v*HP () {t,x,s)udS, (192) 

where we have defined the formal adjoint L* : D(L*) c X — > X of L by 

L*v := - J2 ^(t, x)pj- H & xT 1 £ d{H{t,X ^ {t ' X)) v + H(t, xr'Bit, x)*H(t, x)v. 
y.=0 11=0 



In order for the integrals on the right-hand side of Equation ( 192 1 to vanish, such that (v, Lu)h = 
(L*v,u)h, we first notice that the integral over E vanishes, because u — on E . The integral 
over Tit also vanishes if we require v = on Sy. The last term also vanishes if we require v to lie 
in the dual boundary space 

V* := {veC m : v*HP (t, x, s)u = for all u E V p }. 

for each p € T '. Therefore, if we define 

D(L*) := {v e C^(n T ) : v(p) = for all peE T and v(p) £ V p * for all p E T}, 

we have (v,Lu)h = (L*v,u)h for all u E D(L) and v E D(L*), that is, the operator L* is adjoint 
to L. There is the following nice relation between the boundary spaces V p and V* : 

Lemma 4. Let p E T be a boundary point. Then, V p is maximal nonpositive if and only if V* is 
maximal nonnegative. 

Proof. Fix a boundary point p — (t,x) E T and define the matrix B := H(t,x)Po(t,x,s) with s 
the unit outward normal to <9£ at x. Since the system is symmetric hyperbolic, B is Hermitian. 
We decompose C m = E + (BE_ ®E into orthogonal subspaces E + , E-, Eq on which B is positive, 
negative and zero, respectively. We equip E± with the scalar products (•, -)± which are defined by 

(u±,v±)±:=±u±Bv±, u±,v ± E E ± . 

In particular, we have u*Bu — (u+, u + ) + — (u_, u_)_ for all u E C m . Therefore, if V p is maximal 
nonpositive, there exists a linear transformation q : E- — > E + satisfying |qw_| + < |u_|_ for all 
E E- such that (cf. Equation ( |181[ )) 



u 



± 



V p = {uEC m :u + = qu-}. 

Let veV*. Then, 

= v*Bu= (v + ,u + ) + - (v_,u_)_ = (v + ,qu_) + - (u_,u_)_ = (g f u + ,u_)_ - (v_,u_)_ 

for all u E V p , where q< : E + —t E- is the adjoint of q with respect to the scalar products (•, •) 
defined on E±. Therefore, u_ = q'v+, and 

V* ={vE C m : v- = q^v+}. 

Since q^ has the same norm as q, which is one, it follows that V* is maximal nonnegative. The 
converse statement follows in an analogous way. □ 

The lemma implies that solving the original problem — Lu = F with u E D(L) is equivalent to 
solving the adjoint problem L*v = F with v E D(L*), which, since v(T, x) — is held fixed at Ex, 
corresponds to the time-reversed problem with the adjoint boundary conditions. From the energy 
a priori estimates we obtain: 
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Lemma 5. There is a constant 5 = 8(T) such that 

\\Lu\\h>5\\u\\ h , \\L*v\\ h > 6\\v\\ H (193) 

for all u £ D(L) and v £ D(L*), where \\ ■ \\h is the norm induced by the scalar product (•, -)h ■ 



Proof. Let u £ D{L) and set F := — Lu. From the energy estimates in Section 3.2.3 one easily 
obtains 

E(Z t ) < C\\Ff H , 0<t<T, 

for some positive constants C depending on T. Integrating both sides from t = to t = T gives 

\\u\\ 2 H <CT\\F\\ 2 H = CT\\Lu\\ 2 H , 

which yields the statement for L setting 8 := (CT) -1 / 2 . The estimate for L* follows from a 
similar energy estimate for the adjoint problem, or it follows from the established result for L 
using (v,Lu)h = (L*v,u)h for all u £ D(L) and v £ D(L*). □ 

In particular, Lemma p^ implies that (strong) solutions to the IBVP and its adjoint are unique. 
Since L and L* are closable operators |334] . their closures L and L* satisfy the same inequalities 



as in Equation (193). Now we are ready to define weak solutions and to prove their existence: 



Definition 10. u £ X is called a weak solution of the problem (191) if 

(L*v,u) H = -(v,F) H 

for allv £ D(L*). 

In order to prove the existence of such u £ X, we introduce the linear space Y = D(L*) and 
equip it with the scalar product (•, -)y defined by 

(v, w)y ■= (L*v, L*w)h, v,w£Y. 

The positivity of this product is a direct consequence of Lemma|5j and since L* is closed, Y defines 
a Hilbert space. Next, we define the linear form J : Y — >• C on Y by 

J(v):=-(F,v) H . 

This form is bounded, according to LemmajHJ | J(v) | < ||F|| ff ||v|| H < S^WFWhWL^vWh = S^WFWhWvWy 
for all v £ Y . Therefore, according to the Riesz representation lemma there exists a unique w £ Y 
such that (w, v)y — J(v) for all v £ Y. Setting u := L*w £ X gives a weak solution of the problem. 
If u £ X is a weak solution which is sufficiently smooth, it follows from the Green type iden- 



tity ( 192 ) that u has vanishing initial data and that it satisfies the required boundary conditions, 



and hence is a solution to the original IBVP ( 191 ). The difficult part is to show that a weak solu- 



tion is indeed sufficiently regular for this conclusion to be made. See Refs. }183[ 1267 , 333, 332. 1375] 
for such "weak=strong" results. 

5.3 Absorbing boundary conditions 

When modeling isolated systems, the boundary conditions have to be chosen such that they min- 
imize spurious reflections from the boundary surface. This means that inside the computational 
domain, the solution of the IBVP should lie as close as possible to the true solution of the Cauchy 
problem on the unbounded domain. In this sense, the dynamics outside the computational domain 
is replaced by appropriate conditions on a finite, artificial boundary. Clearly, this can only work 
in particular situations, where the solutions outside the domain are sufficiently simple so that they 
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can be computed and used to construct boundary conditions which are, at least, approximately 
compatible with them. Boundary conditions which give rise to a well-posed IBVP and achieve this 
goal are called absorbing, non-reflecting or radiation boundary conditions in the literature, 
and there has been a substantial amount of work on the construction of such conditions for wave 
problems in acoustics, electromagnetism, meteorology, and solid geophysics (see [200] for a review). 
Some recent applications to General Relativity are mentioned in Sections [6] and |10.3.1| 

One approach in the construction of absorbing boundary conditions is based on suitable series 
or Fourier expansions of the solution, and derives a hierarchy of local boundary conditions with 
increasing order of accuracy [E9, 45, 235.. Typically, such higher order local boundary conditions 
involve solving differential equations at the boundary surface, where the order of the differential 
equation is increasing with the order of the accuracy. This problem can be dealt with by introducing 
auxiliary variables at the boundary surface |20U I202J . 

The starting point for a slightly different approach is an exact nonlocal boundary condition 
which involves the convolution with an appropriate integral kernel. A method based on an efficient 
approximation of this integral kernel is then implemented, see for instance Ref. (141 115] for the 
case of the 2D and 3D flat wave equations and Refs. |263| I262[ 1264] for the Regge- Wheeler [336 
and Zerilli |440] equations describing linear gravitational waves on a Schwarzschild background. 
Although this method is robust, very accurate and stable, it is based on detailed knowledge of the 
solutions which might not always be available in more general situations. 

In the following, we illustrate some aspects of the problem of constructing absorbing boundary 
conditions on some simple examples |360j . Specifically, we construct local absorbing boundary 
conditions for the wave equation with a spherical outer boundary at radius R > 0. 

5.3.1 The one-dimensional wave equation 

Consider first the one-dimensional case, 

uu ~ u xx = 0, \x\ < R, t > 0. (194) 

The general solution is a superposition of a left- and a right-moving solution, 

u(t,x) = f^{x + t) + fXx-t). 
Therefore, the boundary conditions 

(b-u)(t,-R)=0, (b+u)(t,+R) = Q, b ± :=-±— , t>0, (195) 

are perfectly absorbing according to our terminology. Indeed, the operator & + has as its kernel 
the right-moving solutions f/<(x — t), hence the boundary condition (b+u)(t, R) — at x = R is 
transparent to these solutions. On the other hand, b+f\(t + x) = 2fi(t + x), which implies that 
at x = R, the boundary condition requires that f<\(v) = j/\,(l) is constant for advanced time 
v = t + x > R. A similar argument shows that the left boundary condition (b_u)(t,—R) = 
implies that f/*{—u) = fy(—R) is constant for retarded time u = t — x > R. Together with 
initial conditions for u and its time derivative at t = satisfying the compatibility conditions, 
Equations (|194 195 ) give rise to a well-posed IBVP. In particular, the solution is identically zero 



after one crossing time t > 2R for initial data which are compactly supported inside the interval 
(-R,R). 

5.3.2 The three-dimensional wave equation 

Generalizing the previous example to higher dimensions is a nontrivial task. This is due to the fact 
that there are infinitely many propagation directions for outgoing waves, and not just two as in 
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the one-dimensional case. Ideally, one would like to control all the propagation directions k which 
are outgoing at the boundary (k ■ n > 0, where n is the unit outward normal to the boundary), 
but this is obviously difficult. Instead, one can try to control specific directions (starting with the 
one that is normal to the outer boundary). Here, we illustrate the method of Ref. [JS] on the 
three-dimensional wave equation, 



Un - Au = 0, 



Id <R, t>0. 



(196) 



The general solution can be decomposed into spherical harmonics Y im according to 

oo e 

£=0m=-£ 

which yields the family of reduced equations 

ui m (t,r) = 0, 



d?_ <?_ 1(1+1) 
dt 2 dr 2 r 2 



< r < R, t>0. 



(197) 



For £ = this equation reduces to the one-dimensional wave equation, for which the general 
solution is uoo(t,r) = £/ooy( r — + ^oo\(^ + t) with I/oo/ 1 and Uqo\ two arbitrary functions. 
Therefore, the boundary condition 



£o: b(ru)\ r=R = 0, 



b:= 



d_ d_ 
dt dr 



t >0, 



(198) 



is perfectly absorbing for spherical waves. For t > 1, exact solutions can be generated from the 
solutions for £ = by applying suitable differential operators to u o(t 7 r). For this, we define the 
operators 89J 



a, 



which satisfy the operator identities 



d_ 
dr 



a i+\ a i + i — a £ a e 



As a consequence, for each £=1,2,3,..., we have 



4 = 


d £ 
dr r 


d 2 


£{£+!) 


dr 2 


r 2 



(¥_ (P_ 1(1 + 1) 

dt 2 dr 2 r 2 



a\a\ 



= 


\9 2 t 1 


a £ a £-1 


...a\ 


= 4 


\d 2 t 


4-i 


= c 


i\a 


f ft f 

£-1 ■ ■ ■ "l 


' d 2 

dt 2 


d 2 ' 

dr 2 



Therefore, we have the explicit in- and outgoing solutions 

u im ^(t,r) = a\4_ x . . . a\V lm (r + t)= ]T(-1) J ^"^ (2r)^V^(r + t), 



j=o 



U£ m /< 



(t,r) = aWe-i ■ ■ ■ a{U lm (r -t)= ^(-1)' j?_~$\ (2rf ~^ (r-t), (199) 

j=o { J) - J - 
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where Vz m and Ug m are arbitrary smooth functions with j'th derivatives V^j and U^ , respectively. 
In order to construct boundary conditions which are perfectly absorbing for U£ m7 one first notices 
the following identity: 

^ +1 aloJ_ 1 . . . a[U(r - t) = (200) 

for all i = 0, 1 ,2, . .. and all sufficiently smooth functions U. This identity follows easily from 



Equation ([199]) and the fact that b e+1 (r k ) = k(k + l)---(k+£)r k+e+1 =0 if fee {0,-1,-2,. .. ,-£}. 
Therefore, given L € {1, 2,3,.. .}, the boundary condition 

B L : b L+1 (ru)\ r=R = 0, (201) 

leaves the outgoing solutions with £ < L unaltered. Notice that this condition is local in the 
sense that its formulation does not require the decomposition of u into spherical harmonics. Based 
on the Laplace method, it was proven in J3S] (see also Ref. |357] ) that each boundary condition 
Bl yields a well posed IBVP. By uniqueness this implies that initial data corresponding to a 
purely outgoing solution with £ < L yields a purely outgoing solution (without reflections). In 
this sense, the condition Bl is perfectly absorbing for waves with £ < L. For waves with £ > L, 
one obtains spurious reflections; however, for monochromatic radiation with wave number k, the 
corresponding amplitude reflection coefficients can be calculated to decay as (fci?)~ 2 ( L+1 ) in the 
wave zone kR ^$> 1 [85j . Furthermore, in most scenarios with smooth solutions, the amplitudes 
corresponding to the lower few £'s will dominate over the ones with high £ so that reflections 
from high ts are unimportant. For a numerical implementation of the boundary condition Bi via 
spectral methods and a possible application to General Relativity see Ref. [306J. 

5.3.3 The wave equation on a curved background 

When the background is curved, it is not always possible to construct in- and outgoing solutions 
explicitly, as in the previous example. Therefore, it is not even clear how a hierarchy of absorbing 
boundary conditions should be formulated. However, in many applications the spacetime is asymp- 
totically flat, and if the boundary surface is placed sufficiently far from the strong field region, one 
can assume that the metric is a small deformation of the flat, Minkowski metric. To first order in 
M/R with M the ADM mass and R the areal radius of the outer boundary, these correction terms 
are given by those of the Schwarzschild metric, and approximate in- and outgoing solutions for all 
(I, m) modes can again be computed [360J rj The M/R terms in the background metric induce 
two kind of corrections in the in- and outgoing solutions ui m \^ . The first is a curvature correction 



term which just adds M/R terms to the coefficients in the sum of Equation (199 1. This term is 
local and still obeys Huygens' principle. The second term is fast decaying (it decays as R/r e+1 ) 
and describes the backscatter off the curvature of the background. As a consequence, it is nonlocal 
(it depends on the past history of the unperturbed solution) and violates Huygens' principle. 

By construction, the boundary conditions Bl are perfectly absorbing for outgoing waves with 
angular momentum number £ < L, including their curvature corrections to first order in M/R. If 
the first order correction terms responsible for the backscatter are taken into account, then Bl are 
not perfectly absorbing anymore, but the spurious reflections arising from these correction terms 
have been estimated in |360| to decay at least as fast as (M/R)(kR)~ 2 for monochromatic waves 
with wave number k satisfying M <C fc _1 <C R. 

The well-posedness of higher order absorbing boundary conditions for wave equations on a 
curved background can be established by assuming the localization principle and the Laplace 



method [357J . Some applications to General Relativity are discussed in Sections [6] and 10.3.1 



25 A systematic derivation of exact solutions including the correction terms in M/R of arbitrarily high order in the 
Schwarzschild metric was given in Ref. [39] in a different context. However, a generic asymptotically flat spacetime 
is not expected to be Schwarzschild beyond the order of M/R, since higher-correction terms involve, for example, 
the total angular momentum. See Ref. 11321 for the behavior of linearized gravitational perturbations of the Kerr 
spacetime near future null infinity. 
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6 Boundary Conditions for Einstein's Equations 

The subject of this section is the discussion of the initial-boundary value problem (IBVP) for Ein- 
stein's field equations. There are, at least, three difficulties when formulating Einstein's equations 
on a finite domain with artificial outer boundaries. First, as we have seen in Section [4l the evolu- 
tion equations are subject to constraints which in general propagate with nontrivial characteristic 
speeds. As a consequence, in general there are incoming constraint fields at the boundary that 
need to be controlled in order to make sure that the constraints propagate correctly, i.e., that 
constraint satisfying initial data yields a solution of the evolution equations and the constraints 
on the complete computational domain, and not just on its domain of dependence. The control 
of these incoming constraint fields leads to constraint-preserving boundary conditions, and a non- 
trivial task is to fit these conditions into one of the admissible boundary conditions discussed in 
the previous section, for which well posedness can be shown. 

A second issue is the construction of absorbing boundary conditions. Unlike the simple examples 
considered in Section |5.3| for which the fields evolve on a fixed background and in- and outgoing 
solutions can be represented explicitly, or at least characterized precisely, in General Relativity it 
is not even clear how to define in- and outgoing gravitational radiation since there are no local 
expressions for the gravitational energy density and flux. Therefore, the best one can hope for is to 
construct boundary conditions which approximately control the incoming gravitational radiation in 
certain regimes, like, for example, in the weak field limit where the field equations can be linearized 
around, say, a Schwarzschild or Minkowski spacetime. 

Finally, the third issue is related to the diffeomorphism invariance of the theory. Ideally, one 
would like to formulate a geometric version of the IBVP, for which the data given on the initial 
and boundary surfaces S and T can be characterized in terms of geometric quantities such as the 
first and second fundamental forms of these surfaces as embedded in the yet unknown spacetime 
(M,g). In particular, this means that one should be able to identify equivalent data sets, i.e., those 
which are related to each other by a diffeomorphism of M leaving So and T invariant, by local 
transformations on So and T, without knowing the solution (M,g). It is currently not even clear 
if such a geometric uniqueness property does exist, see Refs. [1801 I34fj for further discussions on 
these points. 

A well posed IBVP for Einstein's vacuum field equations was first formulated by Friedrich and 
Nagy |181j based on a tetrad formalism which incorporates the Weyl curvature tensor as an inde- 
pendent field. This formulation exploits the freedom of choosing local coordinates and the tetrad 
orientation in order to impose very precise gauge conditions which are adapted to the boundary 
surface T and tailored to the IBVP. These gauge conditions, together with a suitable modification 



of the evolution equations for the Weyl curvature tensor using the constraints (cf. Example 32 1 , 
lead to a first order symmetric hyperbolic system in which all the constraint fields propagate tan- 
gentially to T at the boundary. As a consequence, no constraint-preserving boundary conditions 
need to be specified, and the only incoming fields are related to the gravitational radiation, at least 
in the context of the approximations mentioned above. With this, the problem can be shown to 
be well posed using the techniques described in Section [572] 

After the pioneering work of Ref. |181j , there has been much effort in formulating a well posed 
IBVP for metric formulations of general relativity, on which most numerical calculations are based. 
However, with the exception of particular cases in spherical symmetry [244] . the linearized field 
equations [301] or the restriction to flat, totally reflecting boundaries |59"2ll59"3l HD51 l9"51 12191 I2T51 
13981 1271 [T5] , not much progress had been made towards obtaining a manifestly well posed IBVP 
with nonreflccting, constraint-preserving boundary conditions. The difficulties encountered were 
similar to those described in Examples [3l] and [32J Namely, controlling the incoming constraint 
fields usually resulted in boundary conditions for the main system involving either derivatives of 
its characteristic fields or fields propagating with zero speed, when it was written in first order 
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symmetric hyperbolic form. Therefore, the theory of maximal dissipative boundary conditions 
could not be applied in these attempts. Instead, boundary conditions controlling the incoming 
characteristic constraint helds were specified and combined with more or less ad hoc conditions 
controlling the gauge and gravitational degrees of freedom and verified to satisfy the Lopatinsky 



condition |144j) using the Laplace method, see Refs. 383, 1051 13661 12181 1351 ] 1356 ] , 

The breakthrough in the metric case came with the work by Kreiss and Winicour [260 who 
formulated a well-posed IBVP for the linearized Einstein vacuum field equations with harmonic 
coordinates. Their method is based on the pseudo-differential first order reduction of the wave 
equation described in Section [5. 1.3| which, when combined with Sommerfeld boundary conditions, 
yields a problem which is strongly well posed in the generalized sense and, when applied to systems 
of equations, allows a certain hierarchical coupling in the boundary conditions. This work was then 
generalized to shifted wave equations and higher order absorbing boundary conditions in |357) . 
Later, it was recognized that the results in |260j could also be established based on the usual 
a priori energy estimates based on integration by parts |256j . Finally, it was found that the 
boundary conditions imposed were actually maximal dissipative for a specific nonstandard class 
of first order symmetric hyperbolic reduction of the wave system, see Section |5.2.1| Unlike the 
reductions considered in earlier work, such nonstandard class has the property that the boundary 
surface is noncharacteristic, which implies that no zero speed fields are present, and yields a strong 
well posed system. Based on this reduction and the theory of quasilinear symmetric hyperbolic 
formulations with maximal dissipative boundary conditions [2121 1376] , it was possible to extend 
the results in [260, 256 and formulate a well-posed IBVP for quasilinear systems of wave equations 
|257j with a certain class of boundary conditions (see Theorem [8| which was sufficiently flexible to 
treat the Einstein equations. Furthermore, the new reduction also offers the interesting possibility 
to extend the proof to the discretized case using finite difference operators satisfying the summation 



by parts property, discussed in Sections 8.3 and |9.4| 

In order to parallel the presentation in Section |1J here we focus on the IBVP for Einstein's 
equations in generalized harmonic coordinates and the IBVP for the BSSN system. The first case, 
which is discussed in Section [6~T| is an application of Theorem [8] In the BSSN case, only partial 
results have been obtained so far, but since the BSSN system is widely used, we nevertheless present 



some of these results in Section 6.2 In Section 6.3 we discuss some of the problems encountered 



when trying to formulate a geometric uniqueness theorem and, finally, in Section 6.4 we briefly 
mention alternative approaches to the IBVP which do not require an artificial boundary. 

For an alternative approach to treating the IBVP which is based on the imposition of the 
Gauss-Codazzi equations at T, see Refs. [1851 11861 I188U187] . For numerical studies, see Refs. [2441 
[TO2[MllMllMllMl[iia[2MI3Ml[M3[S![2Sni|^ especially Refs. [551] and [357] 

for a comparison between different boundary conditions used in numerical relativity and Ref. [353 
for a numerical implementation of higher absorbing boundary conditions. For review articles on 
the IBVP in general relativity see Refs. [31)01 IMTl WZ5\ . 

At present, there are no numerical simulations which are based directly on the well posed IBVP 
for the tetrad formulation [181 or the well posed IBVP for the harmonic formulation 260, 256, 257 



described in Section 6.1 nor is there a numerical implementation of the constraint-preserving 



boundary conditions for the BSSN system presented in Section 6.2 The closest example is the 
harmonic approach described in [278[ [351, 354J which has been shown to be well posed in the 
generalized sense in the high frequency limit [357] . However, as mentioned above, the well posed 
IBVP in [257J opens the door for a numerical discretization based on the energy method which can 
be proven to be stable, at least in the linearized case. 
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6.1 The harmonic formulation 



Here, we discuss the IBVP formulated in Ref. [257J for the Einstein vacuum equations in generalized 
harmonic coordinates. The starting point is a manifold of the form M = [0, T] x S, with £ a three- 
dimensional compact manifold with C°°-boundary <9£, and a given, fixed smooth background 
metric g a a with corresponding Levi-Civita connection V, as in Section 4.1 We assume that the 
time slices Sj := {<} x S are space-like and that the boundary surface T := 0, T] x <9£ is time-like 
with respect to g a g. 

In order to formulate the boundary conditions, we first construct a null tetrad {if, L^jQ^, Q M } 
which is adapted to the boundary. This null tetrad is based on the choice of a future-directed 
time-like vector field T^ tangent to T which is normalized such that g^JT^T" = — 1. One possible 
choice is to tie T M to the foliation S t , and the define it in the direction orthogonal to the cross 
sections dT, t = {£} x OH of the boundary surface. A more geometric choice has been proposed 
in [180] , where instead T M is chosen as a distinguished future-directed time-like eigenvector of the 
second fundamental form of T, as embedded in (M,g). Next, we denote by iV M the unit outward 
normal to T with respect to the metric g^ and complete T M and N^ to an orthonormal basis 
{T^, A"', V**, W^} of T p M at each point p e T. Then, we define the complex null tetrad by 



if :=T^ + A^, 



L M ;=T m_ N » 



Q" 



F + iF, 



V-iW*, 



where i = y— T. Notice that the construction of these vectors is implicit, since it depends on the 
dynamical metric g a p which is yet unknown. However, the dependency is algebraic, and does not 
involve any derivatives of g a p . We also note that the complex null vector Q 11 is not unique since 
it can be rotated by an angle ip € R, Q M i-> e %(p Q^, Finally, we define a radial function r on T as 
the areal radius of the cross sections dY, t with respect to the background metric. 



Then, the boundary conditions which were proposed in |257j for the harmonic system ( 75 ) are: 
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where Vr-/i 



K'l'LQ 



K^L a ( 



V M /i Q £, h 



KL 



K a L^h a/ s, Hk := K^H^, etc., and where qx and q^ 



are real-valued given smooth functions on T and qQ and qQQ are complex-valued given smooth 
functions on T. Since Q is complex, these constitute ten real boundary conditions for the metric 
coefficients h a p . The content of the boundary conditions ( 202 , 203 204 205 ) can be clarified by 



considering linearized gravitational waves on a Minkowski background with a spherical boundary. 
The analysis in [25 7j shows that in this context the four real conditions ( 202 ) , ( 203 204 ) are related 



to the gauge freedom; and the two conditions (205 1 control the gravitational radiation. The remain- 



ing conditions ( 206 207 208 ) enforce the constraint C M = on the boundary, see Equation ( 76 1 



and so together with the constraint propagation system ( 83 ) and the initial constraints ( 84 ) they 
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guarantee that the constraints are correctly propagated. Based on these observations, it is expected 
that these boundary conditions yield small spurious reflections in the case of a nearly spherical 
boundary in the wave zone of an asymptotically flat curved spacetime. 



6.1.1 Well posedness of the IBVP 



The IBVP consisting of the harmonic Einstein equations (75), initial data (77) and the boundary 
conditions (202 208) can be shown to be well posed as an application of Theore m [8] For this, we 
first notice that the evolution equations (75 1 have the required form of Equation ( 188 1, where E is 



the vector bundle of symmetric, covariant tensor fields h^ u on M , Next, the boundary conditions 
can be written in the form of Equation ( 190 1 with a = 1. In order to compute the matrix coefficients 
c^ A b, it is convenient to decompose /i„„ 



h eAuv in terms of the basis vectors 



eia/3 



= K a Kp, e 2 af3 '■= —2K(aQp), e 3Q/ 3 := —2K^ a Q/ 3 - ) , e^ap := 2Q( a Qi3), 

= QaQ/3, eea/3 '■= QaQ/3, 

= —2L( a Qp- ) , e Sa p := — 2L^ a Q /3 - j , e 9a p := 2K( a L^, e Wa p := L a Lp, 



with h 1 = h LL /4, h 2 = h 3 = h LQ /4, h* = h QQ /A, h b = h 6 = h QQ /A, h 7 = h 8 = Hkq/4, 
h 9 = Iikl/4:, h w — IikkI^- With respect to this basis, the only nonzero matrix coefficients are 



C^2 = 



Q" 
Q" 
Q" 



r-^ 3 n 






c»\ 



rrs = Q^ 



= M 8 = 



^4_ _ 



o = -L», 



which has the required upper triangular form with zeroes in the diagonal. Therefore, the hypothesis 
of Theorem [8] are verified and one obtains a well posed IBVP for Einstein's equations in harmonic 
coordinates. 



This result also applies the the modified system ( 85 1 , since the constraint damping terms which 



are added do not modify the principal part of the main evolution system nor the one of the 
constraint propagation system. 



6.2 Boundary conditions for BSSN 



Here we discuss boundary conditions for the BSSN system (109 116), which is used extensively in 
numerical calculations of spacetimes describing dynamical black holes and neutron stars. Unfortu- 
nately, to date, this system lacks an initial-boundary value formulation for which well posedness in 
the full nonlinear case has been proven. Without doubt the reason for this relies in the structure 
of the evolution equations, which are mixed first/second order in space and whose principal part is 
much more complicated than the harmonic case, where one deals with a system of wave equations. 
A first step towards formulating a well posed IBVP for the BSSN system was performed 
in E 



where the evolution equations (|109 
= (4m 



110 



113-|116|) with a fixed shift and the relation 

Then, 



/ = fj, = (Am — l)/3 were reduced to a first order symmetric hyperbolic system. Then, a set 
of six boundary conditions consistent with this system could be formulated based on the theory 
of maximal dissipative boundary conditions. Although this gives rise to a well posed IBVP, the 
boundary conditions specified in |49j are not compatible with the constraints, and therefore, one 
does not necessarily obtain a solution to the full set of Einstein's equations beyond the domain 
of dependence of the initial data surface. In a second step, constraint-preserving boundary condi- 
tions for BSSN with a fixed shift were formulated in [218J, and cast into maximal dissipative form 
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for the linearized system (see also Ref. [13 ). However, even at the linearized level, these bound- 
ary conditions are too restrictive because they constitute a combination of Dirichlet and Neumann 
boundary conditions on the metric components, and in this sense they are totally reflecting instead 
of absorbing. More general constraint-preserving boundary conditions were also considered in 



and, based on the Laplace method, they were shown to satisfy the Lopatinsky condition (144 1. 



Radiative-type constraint-preserving boundary conditions for the BSSN system (109- 1161 with 



dynamical lapse and shift were formulated in |307j and shown to yield a well posed IBVP in 
the linearized case. The assumptions on the parameters in this formulation are m = 1, / > 0, 
K = 4GH/3 > 0, / 7^ K, which guarantee that the BSSN sys tem is strongly hyperbolic, and as long 



as e *" 7^ 2a they allow for the gauge conditions (119 120) used in recent numerical calculations 



where / = 2/ a and n — e */a , see Section 4.3.1 In the following, we describe this IBVP in more 
detail. First, we notice that the analysis in S ect ion |4 . 3 . 1 1 re veals that for the standard choice m = 1 
the characteristic speeds with respect to the unit outward normal s, to the boundary are 



(3 s ± a, P s ±a^f, 



/3 s ±a- 



where j3 s = /3 J s, is the normal component of the shift. According to the theory described in 
Scction[5]it is the sign of these speeds which determines the number of incoming fields and boundary 
conditions that must be specified. Namely, the number of boundary conditions is equal to the 
number of characteristic fields with positive speed. Assuming \f3 s \ is small enough such that 
| j3 s J a | < min{l, \/J, \/GH, ^/k}, which is satisfied asymptotically if /3 s — > and a — > 1, it is the 
sign of the normal component of the shift which determines the number of boundary conditions. 
Therefore, in order keep the number of boundary conditions fixed throughout evolution^ one has 
to ensure that either /3 s > or (3 s < at the boundary surface. If the condition f3 s — > is imposed 
asymptotically, the most natural choice is to set the normal component of the shift to zero at the 
boundary, (3 s = at T . The analysis in |49j then reveals that there are precisely nine incoming 
characteristic fields at the boundary, and thus, nine conditions have to be imposed at the boundary. 
These nine boundary conditions are as follows: 

• Boundary conditions on the gauge variables 

There are four conditions that must be imposed on the gauge functions, namely the lapse and 
shift. These conditions are motivated by the linearized analysis, where the gauge propagation 
system, consisting of the evolution equations for lapse and shift obtained from the BSSN 



equations (109 112 1161, decouples from the remaining evolution equations. Surprisingly, 



this gauge propagation system can be cast into symmetric hyperbolic form [307 , for which 
maximal dissipative boundary conditions can be specified, as described in Section |5.2[ It 
is remarkable that the gauge propagation system has such a nice mathematical structure, 



since the equations (109 



111 



112) have been specified by hand and mostly motivated by 



numerical experiments instead of mathematical analysis. 



In terms of the operator II 1 



s % Sj projecting onto vectors tangential to the boundary, 



the four conditions on the gauge variables can be written as 



s l d l a = 0, 

13 s =0, 



IT 



d t 



'3k 



8 k dk) & = - F ?—& j ? k d k a. 

J f - K 



(209) 
(210) 

(211) 



Equation (209) is a Neumann boundary condition on the lapse, and Equation (210) sets 



the normal component of the shift to zero, as explained above. Geometrically, this implies 



26 Notice that this is one of the requirements of Theorem [7l for instance, where the boundary matrix must have 
constant rank in order for the theorem to be applicable. 
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that the boundary surface T is orthogonal to the time slices S t . The other two conditions in 



Equation ( 211 ) are Sommerfeld-like boundary conditions involving the tangential components 



of the shift and the tangential derivatives of the lapse; they arise from the analysis of the 



characteristic structure of the gauge propagation system. An alternative to Equation (211) 



also described i n |307j is to set the tangential components of the shift to zero, which, together 



with Equation (210) is equivalent to setting (3 l = at the boundary. This alternative may 
be better suited for IBVP with non-smooth boundaries, such as cubes, where additional 
compatibility conditions must be enforced at the edges. 

• Constraint-preserving boundary conditions 

Next, there are three conditions requiring that the momentum constraint be satisfied at the 
boundary. In terms of the BSSN variables this implies 



2^ 

— j 

3 



IPAn - -D % K + 6ly^> = 8ttG nJi . (212) 



As shown in [307] . Equations (212) yields homogeneous maximal dissipative boundary con- 



ditions for a symmetric hyperbolic first order reduction of the constraint propagation sys- 



tem (126 127 128). Since this system is also linear and its boundary matrix has constant 
rank if j3 s = 0, it follows from Theorem [7] that the propagation of constraint violations is 
governed by a well posed IBVP. This implies, in particular, that solutions whose initial data 
satisfy the constraints exactly automatically satisfy the constraints on each time slice S t . 
Furthermore, small initial constraint violations which are usually present in numerical appli- 
cations yield solutions for which the growth of the constraint violations can be bounded in 
terms of the initial violations. 

• Radiation controlling boundary conditions 

Finally, the last two boundary conditions are intended to control the incoming gravitational 



radiation, at least approximately, and specify the complex Weyl scalar \l/o, cf. Example 32 
In order to describe this boundary condition we first define the quantities £y := Rij 4 

K 



i :i 



(| KAij - A u 4) - AnGNaij and B kij := e 4 ^ [b k A tJ - 4 (b {i </») A j)k 



which 



k-i 



determine the electric and magnetic parts of the Weyl tensor through E ls = £jj — g7ij7 £; 
and Bij — e k i(iB kl j) 1 respectively. Here, e k ij denotes the volume form with respect to the 
three metric 7^. In terms of the operator P U / TO = H l nlP m ) — in y Tl; m projecting onto 
symmetric trace-less tangential tensors to the boundary, the boundary condition reads 

P ImSij + \ s P Im ~ s P i m J Bkij — P Im^iji (213) 

with Gij a given smooth tensor field on the boundary surface T. The relation between Gij 
and $o is the following: if n = a^ 1 (dt — f3 l di) denotes the future-directed unit normal to 
the time slices, we may construct an adapted Newman-Penrose null tetrad {K,L,Q,Q} at 
the boundary by defining K := n + s, L := n — s, and by choosing Q to be a complex 
null vector orthogonal to K and L, normalized such that Q^Q^ — 2. Then, we have ^o = 
(Eki — iP>ki)Q k Q l = GkiQ k Q l ■ For typical applications involving the modeling of isolated 
systems one may set Gij to zero. However, this in general is not compatible with the initial 



data (see the discussion in Section 10.3), an alternative is then to freeze Gij its value to the 
one computed from the initial data. 



The boundary condition (213) can be partially motivated by considering an isolated system 
which, globally, is described by an asymptotically flat spacetime. Therefore, if the outer 
boundary is placed far away enough from the strong field region, one may linearize the field 
equations on a Minkowski background to a first approximation. In this case, one is in the 



8-1 



same situation as in Example |32[ where the Weyl scalar \J/ is an outgoing characteristic 
field when constructed from the adapted null tetrad. Furthermore, one can also appeal to 
the peeling behavior of the Weyl tensor |319] in which \Po is the fastest decaying component 
along an outgoing null geodesies and describes the incoming radiation at past null infinity. 
While VPo can only be defined in an unambiguous way at null infinity, where a preferred null 



tetrad exists, the boundary condition (2131 has been successfully numerically implemented 
and tested for truncated domains with artificial boundaries in the context of the harmonic 
formulation, see for example 354 . Estimates on the amount of spurious reflection introduced 
by this condition have also been derived in (S3 US] , see also 



6.3 Geometric existence and uniqueness 

The results mentioned so far concerning the well posed IBVP for Einstein's field equations in 
the tetrad formulation of 181 , in the metric formulation with harmonic coordinates described 
in Section |6.1[ or in the linearized BSSN formulation described in Section |6.2| allow, from the 
PDE point of view, to construct unique solutions on a manifold of the form M = [0, T] x £ 
given appropriate initial and boundary data. However, since general relativity is a diffeomorphism 
invariant theory, one needs to pose the IBVP from a geometric perspective. In particular, the 
following questions arise, which, for simplicity, we only formulate for the vacuum case: 

• Geometric existence. Let (M, g) be any smooth solution of Einstein's vacuum field equa- 
tions on the manifold M = [0, T]xS corresponding to initial data (h, k) on £ and boundary 
data ip on T, where h and k represent, respectively, the first and second fundamental forms 
of the initial surface So as embedded in (M, <?). Is it possible to reproduce this solution with 
any of the well posed IBVP mentioned so far, at least on a submanifold M' — [0, T'] x £ 
with < T' < Tl That is, does there exist initial data / and boundary data q for this IBVP 
and a diffeomorphism <\> : M' — > 4>(M') C M which leaves £o and T' invariant, such that the 
metric constructed from this IBVP is equal to <jfg on M'l 

• Geometric uniqueness. Is the solution (M,g) uniquely determined by the data (h,k,ip)l 
Given a well posed IBVP for which geometric existence holds, the question about geometric 
uniqueness can be reduced to the analysis of this particular IBVP in the following way: let 
Mi and ui be two solutions of the IBVP on the manifold M = [0, T] x £ with corresponding 
data (/i,<7i) and (/2,<?2)- Suppose the two solutions induce the same data (h,k) on £o and 
-0 on T. Does there exist a diffeomorphism cf) : M' — [0, T'] x £ — >• cj)(M') C M which leaves 
Eo and T' invariant, such that the metrics g\ and gi corresponding to ui and ui are related 
to each other by gi = 4>*gi on M'l 

These geometric existence and uniqueness problems have been solved in the context of the 



Cauchy problem without boundaries, see |124) and Section 4.1.3 When boundaries are present, 
however, several new difficulties appear as pointed out in |180j . see also Refs. |181| 1179] : 

(i) It is a priori not clear what the boundary data ip should represent geometrically. Unlike the 
case of the initial surface, where the data represents the first and second fundamental forms 
of Eo as a spatial surface embedded in the constructed spacetime (M, g) , it is less clear what 
the geometric meaning of iJj should be since it is restricted by the characteristic structure of 
the evolution equations, as discussed in Section [5J 



(ii) The boundary data (qx, qL,qQ, Qqq) m the boundary conditions (202 203 204 205) for the 
harmonic formulation and the boundary data Gij in the boundary condition (213) for the 
BSSN formulation ultimately depend on a specific choice of a future-directed time-like vector 
field T at the boundary surface T ■ Together with the unit outward normal N to T, this 
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vector defines the preferred null directions K = T + N and L = T — N which are used to 
construct the boundary adapted null tetrad in the harmonic case and the projection operators 
n^„ = <^„ + T»T V - N^N U and P^ al3 = n^II^ - §IP"II a/ 3 in the BSSN one. Although 
it is tempting to define T as the unit, future-directed time- like vector tangent to T which is 
orthogonal to the cross sections 9E t , this definition would depend on the particular foliation 
S t the formulation is based on, and so the resulting vector T would be gauge-dependent. A 
similar issue arises in the tetrad formulation of Ref. |181| . 

(iii) When addressing the geometric uniqueness issue, an interesting question is whether or not it 
is possible to determine from the data sets (/i, <Zi) and (/2, (fe) alone if they are equivalent in 
the sense that their solutions ui and «2 induce the same geometric data (h, k, ip). Therefore, 
the question is whether or not one can identify equivalent data sets by considering only 
transformations on the initial and boundary surfaces So and T, without knowing the solutions 
U\ and 1*2- 

Although a complete answer to these questions remains a difficult task, there has been some 
recent progress towards their understanding. In [180] a method was proposed to geometrically 
single out a preferred time direction T at the boundary surface T . This is done by considering the 
trace-free part of the second fundamental form, and proving that under certain conditions which 
are stable under perturbations, the corresponding linear map on the tangent space possesses a 
unique time-like eigenvector. Together with the unit outward normal vector N, the vector field T 
defines a distinguished adapted null tetrad at the boundary, from which geometrical meaningful 
boundary data could be defined. For instance, the complex Weyl scalar ^0 can then be defined 
as the contraction ^0 = C a p^sK a Q^K 1 Q s of the Weyl tensor Cap^g associated to the metric 
g^v along the null vectors K and Q, and the definition is unique up to the usual spin rotational 
freedom Q i-> e ltp Q, and therefore, the Weyl scalar ^q is a good candidate for forming part of the 
boundary data ip. 

In [341] it was suggested that the unique specification of a vector field T may not be a funda- 
mental problem, but rather the manifestation of the inability to specify a non-incoming radiation 
condition correctly. In the linearized case, for example, setting to zero the Weyl scalar \&o com- 
puted from the boundary adapted tetrad is transparent to gravitational plane waves traveling 
along the specific null direction K = T + N, see Example [32] but it induces spurious reflections 
for outgoing plane waves traveling in other null direction. Therefore, a genuine non-incoming ra- 
diation condition should be, in fact, independent of any specific null or time-like direction at the 
boundary, and can only depend on the normal vector N. This is indeed the case for much simpler 
systems like the scalar wave equation on a Minkowski background 149 , where perfectly absorbing 
boundary conditions are formulated as a nonlocal condition which is independent of a preferred 
time direction at the boundary. 

Aside from controlling the incoming gravitational degrees of freedom, the boundary data tp 
should also comprise information related to the geometric evolution of the boundary surface. 
In [181] this was achieved by specifying the mean curvature of T as part of the boundary data. In 
the harmonic formulation described in Section [6. 1| this information is presumably contained in the 
functions qj<, <Zl and <?q, but their geometric interpretation is not clear. 

In order to illustrate some of the issues related to the geometric existence and uniqueness 
problem in a simpler context, in what follows we analyze the IBVP for linearized gravitational 
waves propagating on a Minkowski background. Before analyzing this case, however, we make 
two remarks. First, it should be noted |180j that the geometric uniqueness problem, especially an 
understanding of point (iii), also has practical interest, since in long term evolutions it is possible 
that the gauge threatens to break down at some point, requiring a redefinition. The second remark 
concerns the formulation of the Einstein IBVP in generalized harmonic coordinates, described in 



Sections 4.1 and 6.1 where general covariance was maintained by introducing a background metric 
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g on the manifold M. IBVPs based on this approach have been formulated in [357) and [257] 
and further developed in |422j and [421] . However, one has to emphasize that this approach does 
not automatically solve the geometric existence and uniqueness problems described here: although 
it is true that the IBVP is invariant with respect to any diffeomorphism <f> : M — > M which acts 
on the dynamical and the background metric at the same time, the question on the dependency 
of the solution on the background metric remains. 



6.3.1 Geometric existence and uniqueness in the linearized case 

Here we analyze some of the geometric existence and uniqueness issues of the IBVP for Einstein's 
field equations in the much simpler setting of linearized gravity on Minkowski space, where the 
vacuum field equations reduce to 



VVuhap-VaVfih- 



•2VV( a ft 



«M 



0, 



(214) 



where h a p denotes the first variation of the metric, h := r\ a °h a p its trace with respect to the 
Minkowski background metric r\ a p , and V M is the covariant derivative with respect to r\ a p . An in- 
finitesimal coordinate transformation parametrized by a vector field £ p induces the transformation 



T-a/3 



i-)- h, 



a/3 



la/3 



•2V( Q £y3), 



(215) 



where £ Q := rj a p^. 

Let us consider the linearized Cauchy problem without boundaries first, where initial data is 
specified at the initial surface So = {0} x M 3 . The initial data is specified geometrically by the 
first and second fundamental forms of Sq, which in the linearized case are represented by a pair 



,(«) 1.(0) 



(h\j ,k\j) of covariant symmetric tensor fields on Sq. We assume (h\ • , A:).- ) to be smooth and to 
satisfy the linearized Hamiltonian and momentum constraints 



,(0h 



GP^didjhpJ = 0, 



G l3rs d 3 kl°J = 0, 



(216) 



where G lJ>s := (5 l ( r (5 s )J — 5'^8 rs . A solution h a B of Equation (2141 with the induced data corre- 



sponding to (h\° ', k\° ') 



up to a gauge transformation (215) satisfies 



"ylE 



,(°) 



28,,X 



(i^j), 



d t hij 



2d (i h J)o\ So 



,(<>) 



2{kW+d i d j f), 



(217) 



where Xj — £j and / = £o are smooth and represent the initial gauge freedom. Then, one has: 



Theorem 9. The initial-value problem {'214: 211) possesses a smooth solution h a p which is unique 



up to an infinitesimal coordinate transformation h a p = h a p + 2V i a £,/3) generated by a vector field 



Proof. We first show the existenc e of a solution in the linearized harmonic gauge Cp — V^hp^ — 
i^Vg/i = 0, for which Equation (214) reduces to the system of wave equations V^V^/ic^ = 0. 

dth 



(0) 



The initial data, {h a p\^ , 9t/i Q/ 9| E ), for this system is chosen such that /iy| E = n^ , utriijls 

2%/ l , )0 | So - 2fcg> and °d t h 0Q \^ = 2^*<°>, d t h 0j \^ = &(hf - \Sy6 kl h { $) + \ 9>oo| Eo , where 
the initial data for /loo & n d hoj is chosen smooth but otherwise arbitrary. This choice implies the 
satisfaction of Equation (217) with Xj — and / = and the initial conditions C^L =0 and 
<9tC^| s = on the constraint fields Cp. Therefore, solving the wave equation V^V fj,h a p — with 



such data, we obtain a solution of the linearized Einstein equations (214) in the harmonic gauge 
with initial data satisfying (217) with Xj — and / = 0. This shows geometric existence for the 



linearized harmonic formulation. 
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As for uniqueness, suppose we had two smooth solutions of Equations (214 217 1. Then 



since the equations are linear, the difference h a p between these two solutions also satisfies the 

,(0) 



Equations (214 



■„(0) 



217| with trivial data h\j = 0, k\j = 0. We show that /i Q/ g can be transformed 



away by means of an infinitesimal gauge transformation ( 215 ) . For this, define h a fj := /i Q/ 3+2V( Q £^) 



where £^ is required to satisfy the inhomogeneous wave equation 







V a h a p - -Vph 



W a h 



1. 



a/3 



-Vph + V a V a & 



with initial data for £g defined by f | So = /, £i| So = ~ X " 5 *£o| So = -h ao /2, <9 t &| So = -h Qi +dif. 
Then, by construction, h a p satisfies the harmonic gauge, and it can be verified that h a p = 



9th 



't'l-afS 



— 0. Therefore, h a p is a solution of the wave equation V M V M /i Q ^ = with trivial initial 



data, and it follows that h a p = and that h a p = — 2V( a £fi) is a pure gauge mode. □ 

It follows from the existence part of the proof that the quantities ft-ools an d ^ojls > corre- 
sponding to linearized lapse and shift, parametrize pure gauge modes in the linearized harmonic 
formulation. 

Next, we turn to the IBVP on the manifold M = [0,T] x E. Let us first look at the boundary 



conditions ( 202 205 1 , which, in the linearized case reduce to 



^ k^kk\-i — Qk, 



VkIiklIj- 



Ql, 



V 'kJikqIt 
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^xh 



KllQQ 
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There is no problem in repeating the geometric existence part of the proof on M imposing this 
boundary condition, and using the IBVP described in Section |6.1| However, there is a problem 



when trying to prove the uniqueness part. This is because a gauge transformation (2151 induces 
the following transformations on the boundary data, 



qi< = Qk 
VQQ = 



2V\i 



k$k, qL = qL+ V K fz, + V K ^Uk, 

Vq(V^q-V q ^), 



qq = qq + Vk$Q + VkVqZk, 



which overdetermines the vector field £p at the boundary. On the other hand, replacing the 



boundary condition (205) by the specification of the Weyl scalar ^o, leads to |278|, 1357 



^K h QQ +^Q{^QhKK 



2V K h K Q) 



IT 



*0- 



(218) 



Since the left-hand side is gauge-invariant, there is no overdetermination of ^ at the boundary any 
more, and the transformation properties of the remaining boundary data qx, qh and qQ provides 
a complete set of boundary data for £&-, £/, and £q which may be used in conjunction with the 
wave equation V^V^^g = in order to formulate a well-posed IBVP [357] . Provided *$>o is smooth 
and the compatibility conditions are satisfied at the edge S — Eq H T, it follows: 



Theorem 10. 



The IBVP (214 



211 



up to an infinitesimal coordinate transformation h a p 



218) possesses a smooth solution h a p which is unique 
h a p 



2V( Q £^) generated by a vector field 



In conclusion, we can say that in the simple case of linear gravitational waves propagating a 
Minkowksi background we have resolved the issues (i— iii) . Correct boundary data is given to the 
linearized Weyl scalar Wo computed from the boundary adapted tetrad. To linear order, Wo is 
invariant with respect to coordinate transformations, and the time-like vector field T appearing in 
its definition can be defined geometrically by taking the future-directed unit normal to the initial 
surface Eq and parallel transport it along the geodesies orthogonal to Eg. 



Whether or not this result can be generalized to the full nonlinear case it not immediately 
clear. In our linearized analysis we have imposed no restrictions on the normal component £/v of 
the vector field generating the infinitesimal coordinate transformation. However, such a restriction 
is necessary in order to keep the boundary surface fixed under a diffeomorphism. Unfortunately, 
it does not seem possible to restrict £jv in a natural way with the current boundary conditions 
constructed so far. 

6.4 Alternative approaches 

Although the formulation of Einstein's equations on a finite space domain with an artificial time- 
like boundary is currently the most used approach in numerical simulations, there are a number 
of difficulties associated with it. First, as discussed above, spurious reflections from the boundary 
surface may contaminate the solution unless the boundary conditions are chosen with great care. 
Second, in principle there is a problem with wave extraction, since gravitational waves can only 
be defined in an unambiguous (gauge-invariant) way at future null infinity. Third, there is an 
efficiency problem, since in the far zone the waves propagate along outgoing null geodesies so that 
hyperboloidal surfaces which are asymptotically null should be better adapted to the problem. 
These issues have become more apparent as numerical simulations have achieved higher accuracy 
to the point that boundary and wave extraction artifacts are noticeable, and have driven a number 
of other approaches. 

One of them is that one of compactification schemes which include spacelike or null infinity 
into the computational domain. For schemes compactifying spacelike infinity, see Refs. [324, 325 . 
Conformal compactifications are reviewed in Refs. [1681 1178] , and a partial list of references to date 
includes [5151 H72l IT731 H751 11751 ITMl [2401 H68l 12551 1571 14331 14841 [3T)8l [84l 14381 [43m [4851 l48Bl 14371 
12571 15521 1301 I4T]. 

Another approach is Cauchy-characteristic matching (CCM) [5J] \2M HM QUI Q35J [52] , which 
combines a Cauchy approach in the strong field regime (thereby avoiding the problems that the 
presence of caustics would cause on characteristic evolutions) with a characteristic one in the wave 
zone. Data from the Cauchy evolution is used as inner boundary conditions for the characteristic 
one and, viceversa, the latter provides outer boundary conditions for the Cauchy IBVP. An un- 
derstanding of the Cauchy IBVP is still a requisite. CCM is reviewed in [4201 1420] . A related idea 
is Cauchy-perturbative matching [4421 13441 [5J 13581 [5J , where the Cauchy code is instead coupled 
to one solving gauge-invariant perturbations of Schwarzschild black holes or flat spacetime. The 
multipole decomposition in the Regge-Wheeler-Zerilli equations 336, 440, 364. 286, 299 implies 
that the resulting equations are 1+1 dimensional and can therefore extend the region of integration 
to very large distances from the source. As in CCM, an understanding of the IBVP for the Cauchy 
sector is still a requisite. 

One way of dealing with the ambiguity of extracting gravitational waves from Cauchy evo- 
lutions at finite radii is by extrapolating procedures, see for example (TOJ, 1322] for some ap- 
proaches and quantification of their accuracies. Another approach is Cauchy characteristic ex- 
traction (CCE) [5551 I55JI5571I5T1I5D1 155]. In CCE a Cauchy IBVP is solved, and the numerical 
data on a world tube is used to provide inner boundary conditions for a characteristic evolution 
that "transports" the data to null infinity. The difference with CCM is that in CCE there is no 
"feedback" from the characteristic evolution to the Cauchy one, and the extraction is done as a 
post-processing step. 
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7 Numerical Stability 

In the previous sections we have discussed continuum initial and initial-boundary value problems. 
In this section we start with the study of the discretization of such problems. In the same way that 
a PDE can have a unique solution yet be ill posecrj a numerical scheme can be consistent yet not 
convergent due to the unbounded growth of small perturbations as resolution is increased. The 
definition of numerical stability is the discrete version of well posedness. One wants to ensure that 
small initial perturbations in the numerical solution, which naturally appear due to discretization 
errors and finite precision, remain bounded for all resolutions at any given time t > 0. Due to the 
classical Lax-Richtmyer theorem |268j . this property, combined with consistency of the scheme, is 
equivalent in the linear case to convergence of the numerical solution, and the latter approaches 
the continuum one as resolution is increased (at least within exact arithmetic). Convergence of a 
scheme is in general difficult to prove directly, especially because the exact solution is in general 
not known. Instead, one shows stability. 

The different definitions of numerical stability follow those of well posedness, with the I? norm 
in space replaced by a discrete one, which is usually motivated by the spatial approximation. For 
example, discrete norms under which the Summation By Parts property holds are natural in the 
context of some finite difference approximations and collocation spectral methods (see Sections ^ 
and [9}. 

We start with a general discussion of some aspects of stability, and explicit analyses of simple, 
low order schemes for test models. There follows a discussion of different variations of the von 
Neumann condition, including an eigenvalue version which can be used to analyze in practice 
necessary conditions for initial-boundary value problems. Next, we discuss a rather general stability 
approach for the method of lines, the notion of time-stability, Runge-Kutta methods, and we close 
the section with some references to other approaches not covered here, as well as some discussion 
in the context of numerical relativity. 

7.1 Definitions and examples 

Consider a well posed linear initial value problem (see Definition [3]) 

u t (t,x) = P(t,x,d/dx)u(t,x), iel°, £>0, (219) 

u(0,x) = f(x), x eR n . (220) 



Definition 11. An approximation- discretization to the Cauchy problem (219. 220) is numerically 
stable if there is some discrete norm in space \\ ■ ||<j and constants K d , a d such that the corresponding 
approximation v satisfies 

IKV)Hd<^de Qit ||/|| d , (221) 

for high enough resolutions, smooth initial data f , and t > 0. 
Note: 
• The previous definition applies both to the semidiscrete case (where space but not time is 



discretized) as well as the fully discrete one. In the latter case, Equation (221) is to be 
interpreted at fixed time. For example, if the timestep discretization is constant, 

t k = kAt, A: = 0,1,2... 



then Equation (221) needs to hold for fixed tk and arbitrarily large k. In other words, the 
solution is allowed to grow with time, but not with the number of timestcps at fixed time 
when resolution is increased. 



7 See, for instance Example 6 with initial data / £ S u 
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• The norm || • || d in general depends on the spatial approximation, and in Sections ^ and M we 
discuss some definitions for the finite difference and spectral cases. 



• 



From Definition [TT1 one can see that an ill posed problem cannot have a stable discretization 



since otherwise one could take the continuum limit in (221 ) and reach the contradiction that 
the original system was well posed. 



• As in the continuum, Equation (221 ) implies uniqueness of the numerical solution v. 



• 



In Section [3] wc discussed that if in a well posed homogeneous Cauchy problem a forcing term 



is added to Equation (2191 



Ut(t,x)=P(t,x,d/dx)u(t,x) ^ u t (t,x) = P(t,x,d/dx)u(t,x) + F(t,x) 1 (222) 

then the new problem admits another estimate, related to the original one via Duhamel's 



formula, Equation (15). A similar concept holds at the semidiscrete level, and the discrete 
estimates change accordingly (in the fully discrete case the integral in time is replaced by a 
discrete sum), 

IKV)Hd<^de Qd 1/||d * |KV)lld<^de Qdt l\\f\\ d + J\\F(s,-)\\ d ds . (223) 

In other words, the addition of a lower order term does not affect numerical stability, and 
without loss of generality one can restrict stability analyses to the homogeneous case. 

• The difference w := u — v between the exact solution and its numerical approximation 



satisfies an equation analogous to (222 1, where F is related to the truncation error of the 



approximation. If the scheme is numerically stable, then in the linear and semidiscrete cases 



Equation (223) implies 

\\w(t,-)\\ d <K d e a ^ I \\F(s,-)\\ d ds. (224) 



If the approximation is consistent, the truncation error converges to zero as resolution is 



increased, and Equation (224) implies that so does the norm of the error \\w(t, -)|| d . That is, 
stability implies convergence. The inverse is also true and this equivalence between conver- 
gence and stability is the celebrated Lax-Richtmyer theorem. The equivalence also holds in 
the fully discrete case. 



• In the quasi- linear case one follows the principle of linearization, as described in Section |3.3| 
One linearizes the problem, and constructs a stable numerical scheme for the linearization. 
The expectation, then, is that the scheme also converges for the nonlinear scheme. For 
particular problems and discretizations this expectation can be rigorously proved (see, for 
example, |253J). 

From hereon {xj,tk} denotes some discretization of space and time. This includes both finite 
difference and spectral collocation methods, which are the ones discussed in Sections [8] and [9] 
respectively. In addition, we use the shorthand notation 

Vj := v(tk,Xj). 
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In order to gain some intuition into the general problem of numerical stability we start with 
some examples of simple, low order approximations for a test problem. Consider uniform grids 
both in space and time 

t k = kAt, Xj=jAx, k = 0,1,2,..., j = 0,1, 2,... TV, 

and the advection equation, 

u t = au x , xE [0, 2tt], t> 0, (225) 

on a periodic domain with 2n = NAx, and smooth periodic initial data. Then the solution u can 
be represented by a Fourier series: 

u(t,x) = -±=Y i uit,u)<* MI , (226) 

where 

1 /-27T 



u(t,u) = -= e' lulx u{t,x)dx, 

V 2tt Jo 

and the stability of the following schemes can be analyzed in Fourier space. 
Example 33. The one-sided Euler scheme. 



Equation ( 225 1 is discretized with a one-sided finite difference approximation for the spatial 



derivative and evolved in time with the forward Euler scheme, 

At Ax 

In Fourier space the approximation becomes 

v k+1 (Lu) = q(uj)v k (uj) = [q»f +1 w°(w) , (227) 

where 

g(w) = 1 + aX (e wAx - l) 

is called the amplification factor and 

Ax 
the Courant Friedrich Levy (CFL) factor. 
Using Parseval's identity, we find 



;(i fe ,.)|| 2 = El^)| 2fc |-°H| 2 ' 



uei 



and therefore, we see that the inequality (221 1 can only hold for all k if 

\q(oj)\ < 1 for all weZ. (228) 

For a > 0, this is the case if and only if the CFL factor satisfies 

< A < -, (229) 

a 



and in this case the well posedness estimate (221 1 holds with K d = 1 and a d = 0. The upper bound 



in condition (229) for this example is known as the CFL limit, and (2281 as the von Neumann 
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condition. If a = 0, q(uj) — 1, while for a < the scheme is unconditionally unstable even 
though the underlying continuum problem is well posed. 

Next we consider a scheme very similar to the previous one, but which turns out to be uncon- 
ditionally unstable for a^O, regardless of the direction of propagation. 

Example 34. A centered Euler scheme. 

ion to Equation (225), 

(230) 



Consider first the semidiscrete approximation to Equation (225), 

d _ Vj+i - Vj-i 



dt J 2Ax 

it is easy to check that it is stable for all values of Ax. Next discretize time through an Eulcr 
schcmc, leading to 

±]zA = a v j±i^U. (231) 

At 2Ax v ' 

The solution again has the form given by Equation ( |227[ ), now with 

|g(w)| = |l + iaAsin(wAx)| > 1. 



At fixed time tk, the norm of the solution to the fully discrete approximation (231) for arbitrary 
small initial data with ujAx ^ 7rZ grows without bound as the timestep decreases. 



The semidiscrete centered approximation ( 230 ) and the fully discrete centered Euler scheme 



(231 ) constitute the simplest example of an approximation which is not fully discrete stable even 



though its semidiscrete version is. This is related to the fact that the Euler time integration is not 



locally stable, as discussed in Section 7.3.2 



The previous two examples were one-step methods, where v k+1 can be computed in terms 
of v k . The following is an example of a two-step method. 

Example 35. Leap-frog. 

A way to stabilize the centered Euler scheme is by approximating the time derivative by a centered 

difference instead of a forward, one-sided operator: 

Enlarging the system by introducing 



2iaXsiii(ujAx) 1 
1 



"■ : ^ < »*- 



J » V- 



it can be cast into the one-step method 

w k+1 = Q(oj)w k = Q(u) k+1 w°, with Q(uj) = 



By a similar procedure, a general multi-step method can always be reduced to a one-step one. 
Therefore, in the stability results below we can assume without loss of generality that the schemes 
are one-step. 

In the above example the amplification matrix Q(u>) can be diagonalizcd through a transfor- 
mation that is uniformly bounded: 

OM = T{u) ( ^ ®_ \ T-V), Q{u) k = f (w) ( f f ° k ) f-H<o), 
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.V2 



with /j,± = z ± (1 + z 2 ) , z := iaAsin (a; Ax), and 



T( U ) 



1 1 



The eigenvalues /i-t are of unit modulus, |/x±| = 1. In addition, the norms of T(w) and its inverse 
are 

|f(«)| = V2(i + N) , It^MI 



1 



V2(i-M) 

Therefore the condition number of T(w) can be bounded for all u: 

'l + U 



|T(w)| • \T-i(u)\ = 



1/2 n + HA\ 1/2 

1- a A/ 



provided that 



A< 



(232) 



and it follows that the Leap-frog scheme is stable under the condition ( 232 1 



The previous examples were explicit methods, where the solution v, +1 (or w ? +1 ) can be 
explicitly computed from the one at the previous timestep, without inverting any matrices. 

Example 36. Crank-Nicholson. 



Approximating Equation (225 I by 



i-4^°H ' 



At * , 



with 



D Vj 



2Ax 



(v. 



3+1 



V 3-l) 



(233) 



defines an implicit method. Fourier transform leads to 



1 — ia— sin(ujx) 
2 v ; 



-k+l 



1 + ia— sin(wa;) 



The expressions inside the square brackets on both sides are different from zero and have equal 
magnitude. As a consequence, the amplification factor in this case satisfies 



\q(u)\ = 1 for all u G Z and A > 0, 



(234) 



and the scheme is unconditionally stable at the expense of having to invert a matrix to advance 
the solution in time. 

Example 37. Iterated Crank-Nicholson. 

Approximating the Crank-Nicholson scheme through an iterative scheme with a fixed number of 



iterations is usually referred to as the Iterated Crank-Nicholson (ICN) method. For Equation ( 225 ) 
it proceeds as follows 



First iteration: an intermediate variable ^>v is calculated using a second order in space 



centered difference (2331 and an Euler, first order forward time approximation, 



9-1 



Next, a second intermediate variable is computed through averaging, 
The full time step for this first iteration is 



• Second iteration: it follows the same steps. Namely, the intermediate variables 

1 
At 



l^? +l -v?) = Wr 



3 



(2)-+l/2 = l^( 2 ).„ +1+u „ 



are computed, and the full step is obtained from 



1 
At 



(«. 



n+1 



Do^e 



(2)-r ! + l/2 



• Further iterations proceed in the same way. 



The resulting discretization is numerically stable for A < 2/a and p — 2,3, 6, 7, 10, 11, . . . iterations, 
and unconditionally unstable otherwise. In the limit p — ¥ oo the ICN scheme becomes the implicit, 
unconditionally stable Crank-Nicholson scheme of the previous example. For any fixed number 
of iterations, though, the method is explicit and stability is contingent on the CFL condition 
A < 2/a. The method is unconditionally unstable for p = 4,5, 8, 9, 12, 13, . . . because the limit of 



the amplification factor approaching one in absolute value [cf. Equation (234)] as p increases is not 



monotonic. See |402j for details and |368j for a similar analysis for "theta" schemes. 

Similar definitions to that one of Definition ITT] are introduced for the initial-boundary value 
problem. For simplicity we explicitly discuss the semi-discrete case. In analogy with the definition 
of a strongly well posed IBVP (Definition |9| one has 

Definition 12. A semi- discrete approximation to the linearized version of the IBVP (12£\ 13C\ 
131) is numerically stable if there are discrete norms || ■ || d at £ and \\ ■ \\g d at 9E and constants K d = 
K d (T) and e d = e^(T) > such that for high enough resolution the corresponding approximation 
v satisfies 



Ht,-)\\l 



Sd 



\v{s,-)\\l A ds<Kl 



IdA 1 



f\\l+ (\\F(^-)\\l + \\g(s,-)\ 



a,d) 



(235) 



for all t € [0, T] . If the constant e d can be chosen strictly positive, the problem is called strongly 
stable. 

In addition, the semi-discrete version of Definitions [6] and [7] lead to the concepts of strong 
stability in the generalized sense and boundary stability, respectively, which we do not write down 
explicitly here. The definitions for the fully discrete case arc similar, with time integrals such as 



those in Equation (235) replaced by discrete sums. 
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7.2 The von Neumann condition 



Consider a discretization for a linear system with variable in space but time- independent coefficients 
such that 



_fc+i 



Qv fe , 
f, 



(236) 
(237) 



where v fc denotes the gridfunction v fe = {v 1 - : j = 0, 1, . . . , N} and Q is called the amplification 
matrix. We assume that Q is also time-independent. Then 

v fc = Q fe f 



and the approximation ( 236 237 1 is stable if and only if there are constants K d and a d such that 

||Q fc ||d<^de Qdtfc 



(238) 



for all k = 0, 1, 2, . . . and high enough resolutions. 



In practice, condition (2381 is not very manageable as a way of determining if a given scheme 



is stable since it involves computing the norm of the power of a matrix. A simpler condition based 
on the eigenvalues {qi} of Q as opposed to the norm of Q fc is von Neumann's one: 



<e Qd 



Al 



for all eigenvalues g, of Q and all At > 0. 



(239) 



This condition is necessary for numerical stability: if qt is an eigenvalue of Q, q\ is an eigenvalue 
of Q fc and 



k*|<IIQll<^de c 



K d e 



a d kAt 



That is, 



\Qi\ < K 



l/k a d At 



which, in order to be valid for all k, implies Equation (2391 



As already mentioned, in order to analyze numerical stability one can drop low order terms. 
Doing so typically leads to Q depending on At and Ax only through a quotient (the CFL factor) 
of the form (with p = 1 for hyperbolic equations) 



A 



At 

(Ax)p ' 



Q(At,Aa;) = Q(A). 



(240) 



Then for Equation (2391 to hold for all At > while keeping the CFL factor fixed (in particular, 



for small Ai > 0), the following condition has to be satisfied: 

\qi\ < 1 for all eigenvalues qi of Q, (241) 

and one has a stronger version of the von Neumann condition, which is the one encountered in 



Example 33 see Equation ( 228 1 



7.2.1 The periodic, scalar case 

We return to the periodic scalar case, such as the schemes discussed in Examples [33j [34j [35j and 
[36| with some more generality. Suppose then, in addition to the linearity and time-independent 
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assumptions of the previous subsection, that the continuum problem, initial data and discretiza- 



tion (236 237) are periodic on the interval [0, 27t]. Through a Fourier expansion we can write the 



grid function f = (/(xo), /(xi), . . . , /(xjv)) corresponding to the initial data as 



where e l " = (e lulXo , e luXl , . . . , e tuXN ). The approximation becomes 

v fc = -L V f(u))Q k e iu . 

Assuming that Q is diagonal in the basis e lu , such that 

Qe 4W = g(w)e iw 
as is many times the case, we obtain, using ParsevaPs identity, 



iv fc n = (x;i/Mi 2 i«M 



1/2 



A- 1 2 



WG2 



If 



\q(ui)\ < e QdA * for all eigenvalues ui e Z and At > 0, 
for some constant ad then 



|v K ||<e 



ct^kAt 



E i/(* 



1/2 



e a d fcAt|| f |, -e^tnllfll 



(242) 



(243) 



(244) 



and stability follows. Conversely, if the scheme is stable and (2431 holds, (244 1 has to be satisfied: 
take 

f = e iw , 



then 



or 



|q fc H|||f|| = ||v fe ||<A>^||f||, 



|$MI<# d v V 



equivalent to the requirement ( 244 ) on the eigenvalues of Q 



for arbitrary k, which implies (244). Therefore, provided the condition (243) holds, stability is 



7.2.2 The general, linear, time-independent case 

However, as mentioned, the von Neumann condition is not sufficient for stability, neither in its 



original form ( 239 ) nor in its strong one ( 241 ) , unless, for example, Q can be uniformly diagonalized. 



This means that there exists a matrix T such that 

A = T- 1 QT = diag(g ,...,g w ) 
is diagonal and the condition number of T with respect to the same norm, 

*(!?):= IITWIT- 1 !!,. 
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is bounded 

«d(T) < K d 

for some constant K d independent of resolution (an example is that one of Q being normal, QQ* = 

Q*Q). In that case 

v fe = TA fc T -i f 

and 

||v fe || d < K (T)max|g 4 | fc ||f|| d < K d e a - kAt \\{\\ d = K d e a ^\\{\\ d . 

i 

Next, we discuss two examples where the von Neumann condition is satisfied but the resulting 
scheme is unconditionally unstable. The first one is for a well posed underlying continuum problem 
and the second one for an ill posed one. 

Example 38. An unstable discretization which satisfies the von Neumann condition for a trivially 
well posed problem |222j . 

Consider the following system on a periodic domain with periodic initial data 

n ( M i 

Ut = , u = 



discretized as 



At V 



1 \ ^k 



Ax " : Utfv* (245) 



with Dq given by Equation ( 233 1 . The Fourier transform of the amplification matrix and its k-th 
power are 

» _ / 1 A sin 2 (a; Ax) \ ~ k _ ( 1 fcAsin 2 (wAx) 

^ ^{o 1 J ' M ~ ^ 1 

The von Neumann condition is satisfied, since the eigenvalues are 1. However, the discretization 
is unstable for any value of A > 0. For the unit vector e = (0, 1) , for instance, we have 



|Q fc e| = yi + (/cA) 2 sin 4 (wAx) , 

which grows without bound as k is increased for sin (loAx) =/= 0. 

The von-Neumann condition is clearly not sufficient for stability in this example because the 
amplification matrix not only cannot be uniformly diagonalized, but it cannot be diagonalized at 



all because of the Jordan block structure in ( 245 ) 



Example 39. Ill posed problems are unconditionally unstable, even if they satisfy the von Neu- 
mann condition. The following example is drawn from [104] . 
Consider the periodic Cauchy problem 

u t - Au x , (246) 

where u = (ui, U2) 1 ' , A is a 2 x 2 constant matrix, and the following discretization. The right hand 
side of the equation is approximated by a second order centered derivative plus higher (third) order 



numerical dissipation (see Section 8.5 1 



Au x -> AD v - eI(&x) A D z + D±v , 

where / is the 2x2 identity matrix, e > an arbitrary parameter regulating the strength of the 
numerical dissipation and D+,D_ are first order forward and backward approximations of d/dx, 

D +Vj :=^-^, D _ Vi ^^LA. (247) 
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The resulting system of ordinary differential equations is marched in time (method of lines, dis- 
cussed in the next section) through an explicit method: the iterated Crank-Nicholson (ICN) one 



with an arbitrary but fixed number of iterations p (see Example 37 ) 
If the matrix A is diagonalizable, as in the scalar case of Example 



37 the resulting discretization 



is numerically stable for A < 2/o and p = 2,3, 6, 7, 10, 11, . . ., even without dissipation. On the 
other hand, if the system ( |246| ) is weakly hyperbolic, as when the principal part has a Jordan 
block, 

' a 1 



! ' 

one can expect on general grounds that any discretization will be unconditionally unstable. As an 
illustration, this was explicitly shown in Ref. |104j for the above scheme and variations of it. In 
Fourier space the amplification matrix and its k-th power take the form 



c b \ n k — ( ° k ^c k 1 ^ 

c J ' Q ~\ 



Q = ( n „ ) > Q - I n 



with coefficients c, b depending on {a, A, wAi, e} such that for an arbitrary small initial perturbation 
at just one gridpoint, 

ug = (0, 27re) T , v° = (0, 0) T otherwise, 



the solution satisfies 



\v k \\ 2 d > Cfc 5/4 ||v°||2 for some constant C, 



and is therefore unstable regardless of the value of A and e. On the other hand, the von Neumann 
condition \a\ < 1 is satisfied if and only if 

< eA < 1/8. (248) 

Notice that, as expected, the addition of numerical dissipation cannot stabilize the scheme inde- 
pendently of its amount. Furthermore, adding dissipation with a strength parameter e > 1/(8A) 



violates the von Neumann condition ( 248 ) and the growth rate of the numerical instability becomes 
worse. 



7.3 The method of lines 

A convenient approach both from an implementation point of view as well as for analyzing nu- 
merical stability or constructing numerically stable schemes is to decouple spatial and time dis- 
cretizations. That is, one first analyzes stability under some spatial approximation assuming time 
to be continuous (semi-discrete stability) and then finds conditions for time integrators to preserve 
stability in the fully discrete case. 

In general, this method provides only a subclass of numerically stable approximations. However, 
it is a very practical one, since spatial and time stability are analyzed separately and stable semi- 
discrete approximations and appropriate time integrators can then be combined at will, leading to 
modularity in implementations. 

7.3.1 Semi discrete stability 

Consider the approximation 

v t (t) = Lv, t>0 (249) 

v(0) = f (250) 
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for the initial value problem (219, 220). The scheme is semi-discrete stable if the solution to 



Equations (249 250) satisfies the estimate (221). 

In the time-independent case, the solution to (|249 250 1 is 



v(t) = e Lt f 

and stability holds if and only if there are constants K d and a d such that 

||e L '|| d < K d e adt for alii >0. 

The von Neumann condition now states that there exists a constant a d , independent of spatial 
resolution (i.e. the size of the matrix L), such that the eigenvalues ^ of L satisfy 



maxRe(li) < a d . 



(251) 



This is the discrete in space version of the Petrovskii condition, see Lemma [2j As already pointed 
out, it is not always a sufficient condition for stability, unless L can be uniformly diagonalizcd. 
Also, if the lower order terms are dropped from the analysis then 



(Ax)P 



with L independent of Ax, and in order for (251 1 to hold for all Ax (in particular small Ax), 

maxRe(£ ( ) < 0, (252) 

i 

which is a stronger version of the semidiscrete von Neumann condition. 

Semi-discrete stability also follows if L is semi-bounded, that is, there is a constant a d inde- 
pendent of resolution such that (cf . Equation (p~7|) in Theorem flT) 



(v, Lv) d + (Lv, v) d < 2a d ||v|| d for all v 



(253) 



In that case the semi-discrete approximation ( 249 250 ) is numerically stable, as follows immediately 
from the following energy estimate arguments, 



-I 

dr 



d 

dt 



v, v) d = (Lv, v) d + (v, Lv) d < 2a d ||v|| d . 



For a large class of problems which can be shown to be well posed using the energy estimate, one 
can construct semi-bounded operators L by satisfying the discrete counterpart of the properties 



of the differential operator P in Equation (219) that were used to show well posedness. This 



leads to the construction of spatial differential approximations satisfying the Summation By Parts 



property, discussed in Sections 8.3 and 9.4 



7.3.2 Fully discrete stability 



Now we consider explicit time integration for systems of the form ( 249 250 ) with time-independent 



coefficients. That is, if there are N points in space we consider the system of ordinary differential 
equations (ODEs) 

v t = Lv , (254) 

where L is an N x N matrix. 

In the previous subsection we derived necessary conditions for semidiscrete stability of such 



systems. Namely, the von Neumann one in its weak (251) and strong (252) forms. Below we shall 
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derive necessary conditions for fully discrete stability for a large class of time integration methods, 



including Runge-Kutta ones. Upon time discretization, stability analyses of (254) require the 



introduction of the notion of the region of absolute stability of ODE solvers. Part of the subtlety in 
the stability analysis of fully discrete systems is that the size N of the system of ODEs is not fixed; 
instead, it depends on the spatial resolution. However, the obtained necessary conditions for fully 
discrete stability will also turn out to be sufficient when combined with additional assumptions. 
We will also discuss sufficient conditions for fully discrete stability using the energy method. 



Necessary conditions. Recall the von Neumann condition for the semidiscrete system (254 1 



if £i is an eigenvalue of L a necessary condition for semidiscrete stability is [cf. Equation (251 )] 

maxRe(^) <a d . (255) 

i 

for some «d independent of N. 



Suppose now that the system of ODEs (254 1 is evolved in time using a one-step explicit scheme, 

Qv fc , (256) 



r k+l 



and recall (cf. Equation ( 241[ )) that a necessary condition for the stability of the fully discrete 
system (256) under the assumption (240) that Q depends only on the CFL factor is that its 



2^ 



eigenvalues {q{\ satisfy 

max |<7i| < 1 

i 

for all spatial resolutions N. Next, assume that the ODE solver is such that 

Q = R(AiL) , where R is a polynomial in AiL 

and notice that if £, is an eigenvalue of L, then 

r 2 := B.(AUi) 

is an eigenvalue of Q, 

{n} C {q z } , 



(257) 
(258) 



and (257) implies that a necessary condition for fully discrete stability is 

max \rA < 1 . 



(259) 



Definition 13. The region of absolute stability of a one-step time integration scheme R is defined 
as 

S K := {z e C : |R(z)| < 1} . 



The necessary condition ( 259 1 can then be restated as 



Lemma 6 (Fully discrete von Neumann condition for the method of lines.). Consider the semidis- 



crete system \25Jv and a one-step explicit time discretization (256) satisfying the assumptions 
(258). Then, a necessary condition for fully discrete stability is that the spectrum of the scaled 



spatial approximation AiL is contained in the region of absolute stability of the ODE solver R, 

cr(AtL) c S K . 
for all spatial resolutions N. 



3 As we will discuss later, this includes explicit, one-step Runge-Kutta methods. 
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In the absence of lower order terms and under the already assumed conditions (258) the strong 



von Neumann condition (252 1 then implies that S R must overlap the half complex plane {z e 



Re(z) < 0}. In particular, this is guaranteed by locally stable schemes, defined as follows. 

Definition 14. An ODE solver R is said to be locally stable if its region of absolute stability <S R 
contains an open half disc D—(r) := {fi £ C : |/i| < r, Re(/i) < 0} for some r > such that 

D_(r)cS R . 

As usual, the von Neumann condition is not sufficient for numerical stability and we now discuss 
an example, drawn from [261] . showing that the particular version of Lemma H3] is not either. 

Example 40. Consider the following advection problem with boundaries: 



u t = -u x , 

u(t,0) = 0, 

u{0,x) = f(x), 



x €[0,1], <>0, 
x = 0, t > 0, 
a; 6 [0,1], t = 0, 



where / is smooth and compactly supported on [0, 1], and the one-sided forward Euler discretiza- 
tion, cf. Example 33 with injection of the boundary condition (see Section 10 1 



,fc+i 



"i-i 



At 



4 = o, 



for j = 1,2,. 
for j = 0. 



■ N, 



The corresponding semidiscrete scheme can be written in the form 

d 



dt 



Lv 



for (notice that in the following expression the boundary point is excluded and not evolved) 



v = (vi,v 2l ...,v N y 



with L the banded matrix 



L = 



/-I 
1 






-1 

1 






-1 



V o o 



1 



o \ 







-1 J 



followed by integration in time through the Euler method, 



v fc+i = (1 + AtL) 



(260) 



Since L is triangular, its eigenvalues are the elements of the diagonal; namely, {£;} = { — 1/Ax}, 
i.e., there is a single, degenerate eigenvalue q — — 1/Ax. 
The region of local stability of the Euler method is 



S E = {zeC: \l + z\ < 1}, 

which is a closed disk of radius 1 in the complex plane centered at zq 
von Neumann condition then is 

|l + «At|= 1-^ < I 



-1, see Figure [2} The 
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or 



At 

A^ 



< 2. 



On the other hand, if the initial data has compact support, the numerical solution at early times 



is that one of the periodic problem discussed in Example 33 for which the Fourier analysis gave a 
stability condition of 

Ax 



Sufficient conditions. Under additional assumptions, fully discrete stability does follow from 
semi-discrete stability if the time integration is locally stable: 



Theorem 11 (Kreiss-Wu [551]). At 



that 



(i) A consistent semidiscrete approximation to a constant- coefficient, first order initial-boundary 
value problem is stable in the generalized sense (see Definition 12 and the following remarks). 



(ii) The resulting system of ODEs is integrated with a locally stable method of the form \25ffy , 
with stability radius r > 0. 



(Hi) If a € M is such that \a\ < r and 



R(ia) 



— „i<P 



then there is no /3 £ M. such that \/3\ < r , R(if3) — e 1 ^ , and f3 ^ a. 

Then the fully discrete system is numerically stable, also in the generalized sense, under the CFL 
condition 

Ai||L|| d < A for any A < r . 

Remarks 

• Condition (iii) can be shown to hold for any consistent approximation, if r is sufficiently 
small 



Explicit, one-step Runge-Kutta (RK) methods, which will be discussed in Section 7.5 



in particular of the form (258) when applied to linear, time-independent problems. In fact, 



Equation (258), 



consider an arbitrary, consistent, one-step, explicit ODE solver (256) of the form given in 

{AtL) J 



Q = R(AtL) =J2< 



3=0 






with a s =/= 0; 



(261) 



the integer s is referred to as the number of stages. 

Since the exact solution to Equation (254) is v(i) = e* L v(0) and, in particular, 



v(t 



fc+1 ) = e^vfe)=E^^v(i fc ) ) 



3=0 



Equation (261) must agree with the first n terms of the Taylor expansion of e , where 



is the order of the global p*| truncation error of the ODE solver, defined through 



R(AiL) 



„AtL 



O (AtL) 



n+l 



9 This refers to the truncation error at a fixed final time, as opposed to the local one after an iteration. 
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Therefore, we must have 

otj = 1 for < j < n . 

We then see that a scheme of order n needs at least n stages, s > n, and 

' (AiL) J — 'A,T.V 



(262) 



R(AiL) 



E 

3=0 



.;! 



^ (AtLy 



The above expression in particular shows that when s = n (i.e. when the second sum on the 
right hand side is zero) the scheme is unique, with coefficients given by Equation (262 1. In 



particular, for n = 1 = s, such scheme corresponds to the Euler one discussed in the example 



above, see Equation (260) 



• As we will discuss in Section [775] in the nonlinear case it is possible to choose RK methods 
with s = n if and only if n < 5. 

• When s = n, first and second order RK are not locally stable, while third and fourth order 



are. The fifth order Dormand-Prince scheme (also introduced in Section 7.5 1 is also locally 
stable. 



1 - 



o- 



-2- 



-3- 




RK2 

RK3 

-RK4 - 

Euler 




Real 



Figure 2: Regions of absolute stability of RK time integrators with s = n. Second order RK (RK2) 
and the Euler method are not locally stable. 



Using the energy method, fully discrete stability can be shown (resulting in a more restrictive 
CFL limit) for third order Runge-Kutta integration and arbitrary dissipative operators L 274I I397] : 



Theorem 12 (Levermore). Suppose L is dissipative, that is Equation (253) with a d = holds. 
Then, the third order Runge-Kutta approximation v(tk+i) — R3(AiL)v(ife), where 



R 3 (z) = l + z+ — + 



for the semi-discrete problem ( 254 ) * s strongly stable, 

||R 3 (AtL)|| d < 1 



6 ' 
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under the CFL timestep restriction Ai||L|| d < 1. 

Notice that the restriction a^ = is not so severe, since one can always achieve it by replacing 



L with L — a d I. A generalization of Theorem 12 to higher order Runge-Kutta methods does not 
seem to be known. 

7.4 Strict or time-stability 

Consider a well posed problem. For the sake of simplicity, we assume it is a linear initial value one; 
a similar discussion holds for linear initial-boundary value problems. According to Definition [3] 
there are constants K and a such that smooth solutions satisfy 

IK*, Oil < ^e Q '||u(0,-)|| for all* >0. (263) 

Definition 15. For a numerically stable semidiscrete approximation, there are resolution inde- 
pendent constants K d ,a d such that for all initial data 

IK*, Olid < ^de Qd '||w(0, -)||d forallt>0. (264) 

The approximation is called strict or time stable if, for finite differences, 

a d <a + (D{Ax). (265) 

Similar definitions hold in the fully discrete case and/or when the spatial approximation is not a 



finite difference one. Essentially, ( 265 ) attempts to capture the notion that the numerical solution 



should not have, at a fixed resolution, growth in time which is not present at the continuum. The 



problem with the definition, however, is that it is not useful if the estimate (2631 is not sharp 



since neither will be ( 264 ) , and the numerical solution can still exhibit artificial growth. 



Example 41. Consider the problem (drawn from Ref. |270j ) 



u 
u t =u' + - , a < x < b, t > 0, (266) 

x 

u(t,b) = v , x = b, t>0, (267) 

6 

u(0,x) = -, a<x<b, t = 0, (268) 



x 
where b > a > 0, for which the solution is the stationary one 

/ s 1 

U(t,X) = — . 
X 

Defining 

b 

E(t) = (u(t, ■), u(t, 0) with (u,v) — / u(x)v(x)dx 

a 

it follows that 



-E(t) = - - u(a) 2 + 2(u,u/x) < -z+2{u,u/x) < ^ + -(u,u) = ^ + -2E(t). (269) 

at o z tr b z a b^ a 

This energy estimate implies that the spatial norm of u{t) cannot grow faster than e*' a . However, 



in principle this need not be a sharp bound. In fact, Equation (266) is, in disguise, the advection 
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equation (xu) t = (xu) x for which the general solution (xu)(t,x) — F(t + x) does not grow in time. 



Therefore, a numerical scheme for the problem (266 267| 268) whose solutions are allowed to grow 
exponentially at the rate e l l a is strictly stable according to Definition 



15 



but the classification of 

the scheme as such is of not much use if the growth does take place. In order to illustrate this, 
we show the results for two schemes. In the first one, spurious growth takes place although the 
scheme is strictly stable, and in the second case one obtains a strictly stable scheme with respect 
to a sharp energy estimate which does not exhibit such growth. 



If the system ( 266 267 268 ) is approximated by 



v t = Dv + 



(270) 



where D is a finite difference operator satisfying Summation By Parts (SBP) (Section 8.3) and 
the boundary condition is imposed through a projection method (Section |10| , it can be shown -as 
discussed in the next section- that the following semi-discrete estimate holds, at least for analytic 
boundary conditions, with the discrete SBP scalar product (u,v)s defined by Equation (285) in 
the next section, 

d „ , , 1 2 



dt E ^ * V 



-E d (t) , 



E d := (v,v) 



(271) 



Technically, the semi-discrete approximation ( |270[ ) is strictly stable, since the continuum ( |269 ) 
and semi-discrete (271) estimates agree. However, this does not preclude spurious growth, as the 
bounds are not sharp. The left panel of Figure [3] shows results for the -D2-1 SBP operator (see the 
definition in Example 51 below), boundary conditions through an orthogonal projection, and third 



order Runge-Kutta time integration. At any time, the errors do converge to zero with increasing 
resolution as expected since the scheme is numerically stable. However, at any fixed resolution 
there is spurious growth in time. 
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Figure 3: Comparison [270] between numerical solutions to (270) (left, and right labeled as Non- 
strictly stable) and ( |272[ ) (right, labeled as Strictly stable). The left panel shows that the scheme 
is numerically stable but not time-stable. The right panel shows that a conservative scheme is 
time-stable and considerably more accurate. 



On the other hand, discretizing the system as 

1 



i't 



D(xv) 



(272) 



is in general not equivalent to (270) because difference operators do not satisfy the Leibnitz rule 
exactly. In fact, defining 

E d := (xv,xv)s 
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it follows that 



d ~ , - 
Jt E ^ 



a 2 v(a) 2 b 2 v{bf 



(273) 



2 2 

which, being an equality, is as sharp as an estimate can be. 

The right panel of Figure [3] shows a comparison between discretizations (2701 and (272), as 
well as (270) with the addition of numerical dissipation (see Section 8.5), in all cases at the same 
fixed resolution. Even though numerical dissipation does stabilize the spurious growth in time, 
the strictly stable discretization (272) is considerably more accurate. Technically, according to 



Definition 15 the approximation 



270) is also strictly stable, but it is more useful to reserve the 



term to the cases in which the estimate is sharp. The approximation (272), on the other hand, is 



(modulo the flux at boundaries, discussed in Section 10 ) energy preserving or conservative 



In order to construct conservative or time stable semidiscrete schemes one essentially needs to 
write the approximation by grouping terms in such a way that when deriving at the semidiscrete 
level what would be the conservation law at the continuum, the need of using the Leibnitz rule 
is avoided. In addition, the numerical imposition of boundary conditions also plays a role (see 



Section 10 ) 



In many application areas, conservation or time-stability play an important role in the design 
of numerical schemes. That is not so much (at least so far) the case for numerical solutions of Ein- 
stein's equations because in General Relativity there is no gauge-invariant local notion of conserved 
energy unlike many other nonlinear hyperbolic systems (most notably, in Newtonian or special rela- 
tivistic Computational Fluid Dynamics) , see however [388 . In addition, there are no generic sharp 
estimates for the growth of the solution that can guide the design of numerical schemes. However, 
in simpler settings such as fields propagating on some stationary fixed background geometry, there 
is a notion of conserved local energy and accurate conservative schemes are possible. Interestingly, 
in several cases such as Klein-Gordon or Maxwell fields in stationary background spacetimes the 
resulting conservation of the semidiscrete approximations follows regardless of the constraints be- 
ing satisfied (see, for example, |270j h A local conservation law in stationary spacetimes can also 
guide the construction of schemes to guarantee stability in the presence of coordinate singularities 



QM[H3[nil[S02J, as discussed in the Section 7.6 



In addition, there has been work done on variational, symplectic or mimetic integration tech- 
niques for Einstein's equations, which aim at exactly or approximately preserving the discrete 
constraints while solving the discrete evolutions equations. See, for example, |296l 11361 11951 11941 
1751 HP71 13371 1155113351 11701133511335] . 



7.5 Runge— Kutta methods 

In the method of lines approach one ends up effectively integrating a system of ordinary differential 
equations, which we now generically denote by 



d.t 



y(t) = f(t,y). 



The majority of approaches in numerical relativity use one-step, multi-stage, explicit Runge-Kutta 
(RK) methods, which take the following form. 

Definition 16. An explicit, s-stage, Runge-Kutta method is of the form 

fcl = f(tn,yn) , 

h = f (t„ + c 2 Ai, y n + Ata 2 ik 1 ) , 

fc 3 = / (t n + c 3 Ai, y n + At (a 31 ki + a 32 k 2 )) , 



107 



fa = f (t n + c s At, y n + At (a sl fa + . . . + a s , s _ife s _i)) , 
Vn+i = Vn + At(bifa + . . . + b s k s ) . 

with bi (1 < I < s), Cj (2 < j < s), and 0^(1 < i < J — 1) real numbers. 

Next we present a few examples, in increasing order of accuracy. The simplest one is a forward 



finite difference scheme [cf. Equation 247 



Example 42. The Euler method (one stage, first order): 

fa = f(t n ,yn) 

2M+1 = Vn + Atfa 

It corresponds to all the coefficients set to zero except b\ = 1 . 
Example 43. Second order, two-stages RK : 

fa = f(t n ,y n ) 

J At At, 
K2 = J \t n + -p-^i 

y n+ i = y n + Atk 2 , 
The non-vanishing coefficients are c 2 = 1/2 = a 2 i, b 2 = 1. 
Example 44. Third order, four-stages RK: 

fa = f(tn,Vn) 

, f At At, 

«2 = J [t n + —,y n + —ki 

k 3 = f(t n + At,y n + Atk 2 ) 
fa = f(t n + At,y n + Atk 3 ) 

k\ 2k 2 fc4 



Vn+l = Vn + At 

6 3 6 

In the previous case the number of stages is larger than the order of the scheme. It turns out that 
it is possible to have a third order RK scheme with three stages. 
Example 45. Third order, three-stages Heun-RK method 

fa = f(t n ,y n ) 
k 2 = / I t n + —-,y n + —-fa 
. ( 2 At 2 At , 

fa = J I *n + -g-j Vn + ~^rfa 
[fa ifa 

y«+i = Vn + At I — + — 

It is also possible to find a fourth order, four-stages RK scheme. Actually, there are multiple ones, 
with one and two-dimensional free parameters. A popular choice is 



108 



Example 46. The standard fourth order, four-stages RK method: 

ki = f(t n ,y n ) 

,( At At, 

K2 = ./I t n + — , y n + — fci 

At At, 



h = f[t r< 

fc 4 = 
Vn+l = y n + At 

At this point it is clear that since the RK methods have the same structure, it is not very efficient 



2 '"" 2 

f(t n + At,y„ + Atk 3 ) 

'hi 2k 2 2k 3 
6 6 6 



ki 



to explicitly write down all the stages and the final step. Butcher's tables are a common way to 
represent them, they have the following structure: 



Table 1: The structure of a Butcher table. 


C2 021 

C3 a-31 a-32 



a s i a S 2 



a>s,s-i 



h 



b s -i 



Table [2] shows representations of Examples J42j |43j |44j [45j while Table [3] shows the classical 
fourth order Runge-Kutta method of Example |46| 

Table 2: From left to right: The Euler method, second and third order RK, third order Heun. 









1/2 












1/2 


1/2 






1 





1 




1 





1 






1/6 


2/3 


1/6 



1/3 

2/3 



1/3 




2/3 




3/4 



The above examples explicitly show that up to, and including, fourth order accuracy there are 
Runge-Kutta methods of order p and s stages with s = p. It is interesting that even though the 
first RK methods date back to the end of the nineteenth century, the question of whether there 
are higher order (than four) RK methods remained open until the following result was shown by 
Butcher in 1963 [90]: s = p cannot be achieved anymore starting with fifth order accurate schemes, 
and there are a number of barriers. 



Table 3: The standard fourth-order Runge-Kutta method. 





1/2 

1/2 

1 


1/2 




1/2 
1 




1/6 


1/3 1/3 1/6 
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Theorem 13. For p > 5 there are no Runge-Kutta methods with s = p stages. 

There are however, fifth and sixth order RK methods with six and seven stages, respectively. 
Butcher in 1965 [5T] and 1985 [52] respectively showed the next following barriers. 

Theorem 14. For p > 7 there are no Runge-Kutta methods with s = p + 1 stages. 

Theorem 15. For p > 8 there are no Runge-Kutta methods with s = p + 2 stages. 

Seventh and eighth order methods with s = 9 and s = 11 stages, respectively, have been 
constructed, as well as a tenth order one with s = 17 stages. 

7.5.1 Embedded methods 

In practice many approaches in numerical relativity use an adaptive timestep method. One way 
of doing so is to evolve the system of equations two steps with timestep At and one with 2At. 
The difference in both solutions at t + 2 At can be used, along with Richardson extrapolation, to 
estimate the new timestep needed to achieve any given tolerance error. 

In more detail: if we call y 2 the solution at t + 2 At evolved from time t in two steps of size 
At, and yi the solution at the same time advanced from time t in one step of size 2At, then the 
following holds 

y(t + 2At)-y 2 = y ^+0((Atr 1 

where y denotes the exact solution. Therefore, the term 

V2 - 2/i 
2P- 1 

can be used as an estimate of the error and choose the next timestep. 

Embedded methods also compute two solutions and use their difference to estimate the error 
and adapt the timestep. However, this is done by reusing the stages. Two Runge-Kutta methods, 
of order p and p' (in most cases - but not always - p' = p-\- 1 or p = p' + 1) are constructed, which 
share the intermediate function values so that there is no overhead cost. Therefore their Butcher 
table looks as follows: 

Table 4: The structure of embedded methods. 




C2 


021 








C3 


031 


032 






Cs 


a s i 


a S 2 


<2s,s-l 






h 


b 2 .. 


b s -i 


b s 




K 


b' 2 .. 


■ V.-i 


K 



Embedded methods are denoted by p(p'), where p is the order of the scheme which advances 
the solution. For example, a 5(4) method would be of fifth order, with a fourth order scheme which 
shares its function calls used to estimate the error. 

Table [5] shows the coefficients for a popular embedded method, the seven stages Dormand- 
Prince 5(4) |143j . Dormand-Prince methods are embedded methods which minimize a quantifi- 
cation of the truncation error for the highest order component, which is the one used for the 
evolution. 
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Table 5: The 


5(4) Dormand-Prince method. 

















1 


1 










5 


5 










3 


3 


9 








10 


40 


40 








4 


44 


56 


32 






5 


45 


15 


9 






8 


19372 


25360 


64448 


212 




9 


6561 


2187 


6561 


729 






9017 


355 


46732 


49 5103 




1 


3168 


~~33~ 


5247 


176 18656 






35 




500 


125 2187 11 




1 


384 





1113 


192 6784 84 






35 




500 


125 2187 11 




Vi 


384 





1113 


192 6784 84 





i 


5179 





7571 


393 92097 187 


1 


Vi 


57600 


16695 


640 339200 2100 


40 



7.6 Remarks 

The classical reference for the stability theory of finite-difference for time dependent problems 
is |349| . A modern account of stability theory for initial-boundary value discretizations is 222 . 
Ref. 221 includes a discussion of some of the main stability definitions and results, with emphasis 
on multiple aspects of high order methods, and 403, 404 many examples at a more introductory 
level. We have omitted discussing the discrete version of the Laplace theory for IBVP, developed 
by Gustafsson, Kreiss and Sundstrom (known as GKS theory or GKS stability) [223] since it has 
been used very little (if at all) in numerical relativity, where most stability analyses instead rely 
on the energy method. 

The simplest stability analysis is that one of a periodic, constant coefficient test problem. An 
eigenvalue analysis can include boundary conditions and is typically used as a rule of thumb for 
CFL limits or to remove some instabilities. The eigenvalues are usually numerically computed for 
a number of different resolutions. See |167l 1171) for some examples within numerical relativity. 

Our discussion of Runge-Kutta methods follows [53] and [224] . which we refer to, along with 
[225J, for the rich area of methods for solving ordinary differential equations, in particular Rungc- 
Kutta ones. We have only mentioned (one-step) explicit methods, which are the ones used the 
most in numerical relativity, but they are certainly not the only ones. For example, stiff problems 
in general require implicit integrations. Ref. 265, 315, 266 explored implicit-explicit (IMEX) time 
integration schemes in numerical relativity. Among many of the topics that we have not included 
is that one of dense output. This refers to methods which allow the evaluation of an approximation 
to the numerical solution at any time between two consecutive timesteps, at an order comparable 
or close to that one of the integration scheme, and at low computational cost. 
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8 Spatial Approximations: Finite Differences 

As mentioned in the previous section, a general stability theory (referred to as GKS) for initial- 
boundary value problems was developed by Gustafsson, Kreiss and Sundstrbm |223j , and a simpler 
approach, when applicable, is the energy method. The latter is particularly and considerably sim- 
pler than a GKS analysis for complicated systems such as Einstein's field equations, high order 
schemes, spectral methods, and/or complex geometries. The Einstein vacuum equations can be 
written in linearly degenerate form and are therefore expected to be free of physical shocks and 
ideally suited for methods which exploit the smoothness of the solution to achieve fast convergence, 
such as high order finite difference and spectral methods. In addition, an increasing number of 
approaches in numerical relativity use some kind of multi-domain or grid structure approach (see 



Section 11). If the problem at the continuum admits an energy estimate, which as discussed in 
Sections |4] and [6] is usually the case in General Relativity, then at least in the linearized case there 
are multi-domain schemes for which numerical stability can relatively easily be established for a 
large class of linear symmetric hyperbolic problems and maximal dissipativc boundary conditions 
through the energy method. In this section we discuss spatial finite difference (FD) approximations 
of arbitrary high order for which the energy method can be applied, and in Section [9] boundary 
closures for them. Wc start by reviewing polynomial interpolation, followed by the systematic 
construction of finite difference approximations of arbitrary high order and stencils through in- 
terpolation. Next, we introduce the concept of operators satisfying Summation By Parts, present 
a semidiscrete stability analysis, and the construction of high order operators optimized in terms 
of minimizing their boundary truncation error and their associated timestep (CFL) limits (more 
specifically, their spectral radius). Finally, we discuss numerical dissipation, with emphasis on the 
region near boundaries or grid interfaces. 

8.1 Polynomial interpolation 

Although interpolation is not strictly a finite differencing topic, we briefly present it here because 
it is used below and in Section [91 when discussing spectral methods. 

Given a set of (N + 1) distinct points {xj}^ =0 (sometimes referred to as nodal points or nodes) 
and arbitrary associated function values f(xj), the interpolation problem amounts to finding (in 
this case) a polynomial I^{f\{x) of degree less than or equal to N such that I^[f](xj) — f(xj) for 
j=0,l,2,...,7V. 

It can be shown that there is one and only one such polynomial. Existence can be shown by 
explicit construction: suppose one had 

N 

i N [f]{x) = Y J f^)f\ x )' ( 274 ) 

3=0 

where, for each j = 0, 1, . . . , N, I- ' (x) is a polynomial of degree less than or equal to N such that 

lf\xi) = 6ij fori = 0,1,..., TV. (275) 



Then Ijv [/] (aO as given by Equation (274) would interpolate f{x) at the (N+l) nodal points {xi}. 
The Lagrange polynomials, defined as 

if\x)=l n (*-**)][ n (*;-**)) > ( 276 ) 

\k=0,k& J \k=0,k^j J 
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indeed do satisfy Equation ( 275 ) . Uniqueness of the interpolant can be shown by using the property 
that polynomials of order TV can have at most TV roots, applied to the difference between any two 
interpolants. 

Defining the interpolation error by 

E N (x) = \f(x)-l N [f}(x)\ 

and assuming that / is differentiable enough, it can be seen that E^ satisfies 

En{x) = -j^^\f {N+1) {^)^N + x{x)\ , (277) 

where u>n+i(x) := Y[-j=o( x ~ x j) * s called the nodal polynomial of degree (TV + 1), and £ x is 
in the smallest interval I x containing {a;o,a;i, . . . ,xn} and x. In other words, if we assume the 
ordering xq < X\ < . . . < xn, then x can actually be outside [aJoj^jv]- F° r example, if a; < Xo, 
then T x — [x,xn\- Sometimes, approximating f(x) by Tfj[f](x) when x (£ [xq 7 xn] is called 
extrapolation, and interpolation only if a; € [#0)#iv]i even though an interpolating polynomial is 
used as approximation. 

8.2 Finite differences through interpolation 

Finite difference (FD) approximations of a p-th derivative are local linear combinations of function 
values at node points, 

D {p) f{x) := -l-^/foJoito (278) 

= £ p f(x) + 0((Ax) r ), 

where the nodes {xi} do not need to include x (that is, the grid can be staggered). In the finite 
difference case the number of nodes is usually kept fixed as resolution is changed, resulting in a 
fixed convergence order r. This is in contrast to discrete spectral collocation methods, discussed in 
Section [91 which can be seen as approximation through global polynomial interpolation at special 
nodal points. 

A way of systematically constructing FD operators with an arbitrary distribution of nodes, any 
desired convergence order, and which are centered, one-sided or partially off-centered in any way 
is through interpolation. A local polynomial interpolant is used to approximate the function /, 
and the FD approximation is defined as the exact derivative of the interpolant. That is, 

N 



and, for instance for a first derivative, 



i=0 



, N , 

Df(x) := -^I[.f]{x) = ]T f{ Xi )^-tf\x) . (279) 

8=0 



Notice that the expression (279) does have the form of Equation (278), where {xi\ are the nodal 
points of the interpolant and 

ai (x) = (Ax)^\x). 
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The truncation error for the FD approximation (279) to the first derivative can be estimated 

fo 

d 



by differentiating the error formula for the interpolant, Equation (277) 

d 



E r (x) 



dx 



M 



i 



to^W - ta 



(^' (,t ">«« 



d 



(iV+1)! \dx 



f {N+1) (C*)) «(*) + y^f N+1) &)£+>to- ( 28 °) 



The derivative of the first term in Equation ( 280 ) is more complicated to estimate than the second 



one without analyzing the details of the dependence of £ on x. But if we restrict x to be a nodal 
point x = Xk, then w(xfc) = and the previous equation simplifies to 



1 N 

i?/ ' (a;fc)= nvTil! /(Ar+1)( ^ ) II (**-**)• 



(281) 



i=0,i^fc 



Notice that the Equation (281 1 implies that the resulting finite difference approximation has design 



convergence order r — N. For example, assume that the nodes are equally spaced, as is many times 
the case (the fact that the convergence rate is N does not rely on such assumption). Then, 



Ef>(x k ) 



(Ax; 

(JV + l) 



.N N 

w/ (jv+i) fej n ( fc -*)- 



(282) 



i=0S^k 



Example 47. A first order, one sided, finite difference approximation for d/dx. 
We construct a first degree interpolant using two nodal points {xq,xi}, 



l[f]{x) = f{x )l£\x) + f{x l )^\x), 



with 



Then 



e£\x) 



Df{x) 



X — X\ 

x - x\ 

d 



Af]i x ) 



i[ 1 \x)= X - X0 
xi - x 

f(xi) - f(x ) 



(283) 



dx X\ — xq 

If we evaluate x at Xq or x\ we obtain the standard first order forward and backward finite difference 



approximations D + and D-, respectively [cf. Equations (247)]. From Equation (282) we recover 
the known first order convergence for these approximations, r = 1, which can also be obtained 



through a Taylor expansion in Ax of Equation ( 283 1 



Example 48. A second order, centered, finite difference approximation for d/dx. 
Now we construct a second degree interpolant using three nodal points {xq,Xi,X2}, 



Af]{x) - f(x )£$\x) + /(*i)4 a) (a0 + /(x 2 )4 2) (x) , 



with 



6 2 \x)= ( x ~ x i)( x ~ x *) 



(,T -Xl)(x Q -X 2 ) ' 



tf-Vfx)- ( X ~ X 0)( X - X 2) 



(xi -x )(xi -x 2 ) ' 



4 2) (x) = 



(x - x )(x - Xi) 
(x 2 ~x )(x 2 -xi) 



If we assume the points to be equally spaced, x 2 — X\ = Ax = x\ — Xo, and evaluate the derivative 
at the center one, x = Xi, we obtain 

ntr \ d T\*u \ f( x z)~f( x o) 
Df(xi):=- l - x l{f}(xi) = ^ 
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the standard, second order, centered finite difference operator D [cf. Equation (233)]. 

One can proceed in this way to systematically construct any finite difference approximation to 
any derivative with any desired convergence order and distribution of nodal points. The result for 
centered differences approximations to d/dx with even accuracy order r at equally spaced nodes 
can be written in terms of Do, D + , D_ as follows |222) . 

r/2-l 

D r = D J2 (-!)"«« ((A») 2 D + D_)" , (284) 

!/=0 

with 

c^-i forz/=l,2,...,(r/2-l). 



4^ + 2 

Example 49. The fourth, sixth, eighth and tenth order centered FD approximations to d/dx 
are: 

(Ax) 2 
6 



(Ax 



A 



2 r.2 



£> 6 = D i + D ^-^D' + D_, 



(Ax) 6 
D 8 = D 6 -D ^ '—D^D 



30 

6 

3 n 3 



D 1Q = D 8 + Dn { -^DiD 4 



140 

Ax) 6 

630 



8.3 Summation by parts 

Since numerical stability is by definition the discrete counterpart of well posedness, one way to 
come up with schemes which are by construction numerically stable is by designing them so that 
they satisfy the same properties used at the continuum when showing well posedness through an 
energy estimate. As discussed in Section [3] one such property is integration by parts, which leads 
to its numerical counterpart: Summation by Parts [258, 259 . 

Consider a discrete grid consisting of points {xi}fL and uniform spacing Ax on some, possibly 
unbounded, domain [a,b]. 

Definition 17. A difference operator D approximating d/dx is said to satisfy Summation By 
Parts (SBP) on a given domain with respect to a positive definite scalar product S, 



S= A 



x o-i 



(u,v)s ■— Ax 2J UiVjaij , (285) 

i,j=0 

if the property 

(u, Dv)t, + (Du, u) E = u(b)v(b) - u(a)v(a) (286) 

holds for all grid functions u and v. 



115 



This is the discrete counterpart of integration by parts for the -4- operator, 



dx dx 



(f,9) : = / f{x)g{x)dx. 



for all /, g and 



Similar definitions for SBP can be introduced for higher dimensional domains. 

If the interval is infinite, say (—00,6) or (—00,00), certain fall-off conditions are required and 



Equation (286) replaced by dropping the corresponding boundary term(s). 



Example 50. Standard centered differences as defined by Equation ( 284 ) in the domain (—00, 00) 
or for periodic domains and functions satisfy SBP with respect to the trivial scalar product (cry = 

(u,D v)s + (D u,v)j; = 0, (u,v)s ■= Ax^mvi . 



The scalar product or associated norm are said to be diagonal if 
that is, if S is diagonal. It is called restricted full if 

a ibj = a ibih^ibj ' 

where % denote boundary point indices, % = or if, = N: 

\ 



(287) 



(288) 



S = Ax 



( ctoo 




V 



(Jnn ) 




In the case of bounded, non-periodic domains one possibility is to use centered differences and the 
trivial scalar product in the interior and modify both of them at and near boundaries. That is, 
the scalar product has the form 



(289) 



with Si and S r blocks of size independent of Ax. 

Accuracy and Efficiency. As mentioned, in the absence of boundaries standard centered finite 
differences (which have even order of accuracy 2p) satisfy SBP with respect to the trivial (S = Axl) 
scalar product. In their presence the operators can be modified at and near boundaries so as to 
satisfy SBP, examples are given below. It can be seen that the accuracy at those points drops 
to p in the diagonal case and to 2p — I in the restricted full one. Therefore the latter is more 
desirable from an accuracy perspective, but less so from a stability one. Depending on the system, 
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numerical dissipation might be enough to stabilize the discretization in the restricted foil case. 
This is discussed below in Section 18.51 

When constructing SBP operators the discrete scalar product cannot be arbitrarily fixed and 



afterward the difference operator solved for so that it satisfies the SBP property ( 286 1 -in general 



this leads to no solutions. The coefficients of X and those of D have to be simultaneously solved 
for. The resulting systems of equations lead to SBP operators being in general not unique, with 
increasing freedom with the accuracy order. In the diagonal case the resulting norm is automatically 
positive definite but not so in the full-restricted case. 

We label the operators by their order of accuracy in the interior and near boundary points. For 
diagonal norms and restricted full ones this would be Di v _ v and Z?2p-(p-i): respectively. 

Example 51. D 2 -i: For the simplest case, p = 1, the SBP operator and scalar product are 
unique: 

(— U +Ui) 

(u i+ i - m-i) 



D2-lUi 



1 

Ax 



(u,. 



2 

U n -l) 



for i = , 

for i = 1 . . 

for i = N . 



(N-l). 



with coo = &nn = 1/2 and a%j = 6y otherwise. That is, the SBP scalar product (2851 is the 
simple trapezoidal rule for integration. 

The operator Z?4_2 and its associated scalar product are also unique in the diagonal norm case: 



Example 52. D 4 _ 2 : 



D 4 - 2 u t 



Ax 



24 



17 

1 

7, u o 



u 



59 
34 



"l 



17 



U2 



34 



"3 



43 
3 



MO 



2 
59 

86 

59 



U 2 



«1 



59 

86 
32 



"3 



43 
4 



«4 



98 U0 " 98 U2 + 49 M4 ~ 49 U5 



12 



Ui-2 



3 U «-i + 3 Wi +! 



12 



3 59 32 

98 UJV " 9& UN - 2 + 49^ 

4 59 59 
-u N - -«„_! + -u N ^ 

1 1 

- [ _ 2 Ujv + o UN ~ 2 



Ui+2 

4 

49 

4 



U N -5 



43 UN - 4 



24 



59 



Um H M/V-l UM-2 UlV-3 

17 34 17 34 A 



for i = , 

for i = 1 , 
for i = 2 , 
for i = 3 , 

for i = 4 . 



(AT -4). 



for i = N - 3 
for i = N - 2 
for i = TV - 1 
for i = N . 



with scalar product 



... .17 59 43 49 i , 
2j = Ax diag < — , — , — , — ,1,1,. 
' 48' 48' 48' 48' ' ' 



1,1, 



49 43 59 17 

48'48'48'48 
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On the other hand, the operators D e _ 3 , Ds-4, -D10-5 have one, three and ten free parameters, 
respectively. Up to D10-5 their associated scalar products are unique, while for D10-5 one of the 
free parameters enters in E. For the full-restricted case, Z?4_3, £>6-5, -Ds-7 have three, four and 
five free parameters, respectively, all of which appear in the corresponding scalar products. 

A possibility [384 is to use the non-uniqueness of SBP operators to minimize the boundary 
stencil size s. If the difference operator in the interior is a standard centered difference with 
accuracy order 2p then there are b points at and near each boundary where the accuracy is of 
order q (with q = p in the diagonal case and q = 2p — 1 in the full restricted one). The integer b 
can be referred to as the boundary width. The boundary stencil size s is the number of gridpoints 
that the difference operator uses to evaluate its approximation at those b boundary points. 

However, minimizing such size, as well as any naive or arbitrary choice of the free parameters, 
easily leads to a large spectral radius and as a consequence restrictive CFL (Courant-Friedrich- 
Lewy, see Section [7]) limit in the case of explicit evolutions. Sometimes it also leads to rather large 
boundary truncation errors. Thus, an alternative is to numerically compute the spectral radius for 
these multi-parameter families of SBP operators and find in each case the parameter choice that 



leads to a minimum |387[ I273J . It turns out that in this way the order of accuracy can be increased 
from the very low one of D2-1 to higher order ones such as -D10-5 or -Ds-7 with a very small change 



in the CFL limit. It involves some work, but since the SBP property (2861 is independent of the 
system of equations one wants to solve, it only needs to be done once. In the full-restricted case, 
when marching through parameter space and minimizing the spectral radius, this minimization 
has to be constrained with the condition that the resulting norm is actually positive definite. 

The non-uniqueness of high order SBP operators can be further used to minimize a combination 
of the average of the boundary truncation error (ABTE), defined below, without a significant 
increase in the spectral radius. For definiteness consider a left boundary. If a Taylor expansion of 
the finite difference operator is written as 

DuL. = — L. +c l (Ax) q - — —L. for i = 0...b 



then 

ABTE 




1/2 



Table [6] illustrates the results of this optimization procedure for the -D10-5 operator, see |138j for 
more details. 

The coefficients for the SBP operators 

D2-I, £>4-2> £>4-3> ^6-3) "6-57-^8-47 D s _j, Z?io-5 7 

and in particular for their optimized versions are available, along with their associated dissipation 



operators described below in Section 8.5 from the arXiv in |138j and also as complete source code 
from the Einstein Toolkit [T]. 
Remarks: 

• The requirement of uniform spacing is not an actual restriction, since a coordinate transfor- 
mation can always be used so that the computational grid is uniformly spaced even though 
the physical distance between gridpoints varies. In fact, this is routinely done in the context 



of multiple-domains or curvilinear coordinates (see Section 11) 



• In that case, though, stability needs to be guaranteed for systems with variable coefficients, 
since they appear due to the coordinate transformation(s) even if the original system had 
constant coefficients. This has relevance in terms of the distinction between diagonal and 
block-diagonal SBP norms, as mentioned below. 
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Table 6: Comparison, for the -D10-5 operator, of both the spectral radius and average boundary 
truncation error (ABTE) when minimizing the bandwidth or a combination of the spectral radius 
and ABTE. For comparison, the spectral radius and ABTE for the lowest accuracy operator, D1-2, 
(which is unique) are 1.414 and 0.25, respectively. Note: the ABTE, as defined, is larger for this 
operator but its convergence rate is faster. 



Operator 


Min. bandwidth 


Min. ABTE and spectral radius 


Spectral radius 
ABTE 


995.9 
20.534 


2.240 
0.7661 



• 



• 



A similar concept of SBP holds for discrete expansions into Legendre polynomials using 
Gauss-type quadratures, as discussed in Section [9. 4| 

The definition of SBP depends only on the computational domain, not on the system of 
equations being solved. This allows the construction of optimized SBP operators once and 
for all. 

• Difference operators satisfying SBP, which are genuinely multi-dimensional, can be explicitly 
constructed (see, for example, Refs. [100| [99]). However, they become rather complicated 
even for simple geometries as higher order accuracy is sought. An easier approach, for the 
case in which the domain is the cross product of one dimensional ones (say, topologically 
a cube in three dimensions), which is usually the case in many domain-decompositions for 
complex geometries (Section[ll]) is to simply apply a one-dimensional operator satisfying SBP 
in each direction, and this is the approach that we will discuss from hereon. The question 
then is whether SBP holds in several dimensions; the answer is affirmative in the case of 
diagonal norms but not necessarily otherwise. 

8.4 Stability 

As usual, the addition of lower order, undifferentiated terms to the right-hand side of Equa- 



tion (290) still gives an energy estimate at the semidiscrete level, modulo boundary terms. 



If a simple symmetric hyperbolic system with constant coefficients in one dimension of the form 

ut = Au x (290) 

with A = A T symmetric, is discretized in space by approximating the space derivative with an op- 
erator satisfying SBP, a semidiscrete energy estimate can be derived, modulo boundary conditions 



(discussed in Section 10 1. For this, we define 

E d (t) := (u,u)s. 

Taking a time derivative, using the symmetry of A, the fact that D and A commute because the 
latter has constant coefficients, and the SBP property, 

— - = (— u, u)x + (u, — «)s = (DAu, u) s + (Au, Du)^ = [{Aufu} \ . (291) 

Therefore, modulo the boundary terms, an energy estimate and semidiscrete stability follow. 



When considering variable coefficients, Equation (291) becomes 



d ^=[{Au) T u] h a -{u,[D,A}u)* 
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and the commutator between A and D needs to be uniformly bounded for all resolutions in order 
to obtain an energy estimate. 

Estimates for the term involving the commutator [D, A] have been given in Refs. [309, 311, 310, 
1395] . For this, we first notice that the SBP property of D with respect to X and the symmetry of 
A imply that the operator B := [D, A] is symmetric with respect to X, 

(u,[D,A}v)v = ([D,A]u,v)j: 

for all grid functions u and v. Therefore, its norm is equal to its spectral radius and we have the 
estimate 

(u,[D,A]u)v<p([D,A])\\u\\l (292) 

for all grid functions u. Hence, the problem is reduced to finding an upper bound for the spectral 
radius of [-D, A], which is independent of the scalar product X. 

Next, in order to find such an upper bound, we write the finite difference operator as 

1 N 
{Du) 3 = —Y,d 3k u k , j =0,1,... TV, (293) 

k=0 

where the dj^s are the coefficient of a banded matrix, that is, there exists b > which is indepen- 
dent of N such that djk = for \k — j\ > b. Then, we have 

Bus = [D, A] Uj = J2 d ^ A{Xk \ A{Xj) u ^ J=0,h---N, 

k=0 

from which it follows, under the assumption that A is continuously differentiable and that its deriva- 

N 

tive is bounded, \Buj\ < |^xU J2 \ k ~ j\\djk\\uk\, 3 = 0, 1, ...N, where {A^ := sup |Ab(o;)|. 

k=0 a<x<b 

Now we can easily estimate the spectral radius of B 7 based on the simple observation that 

p(B)<\\B\\, ||B||:=supM 

u/0 \\U\\ 

N 

for any norm || • || on the space of grid functions u. Choosing the 1-norm ||u|| := ^ \v,j\ we find 

j=0 

N N I N 

\\ Bu \\ =^2\ Bu j\ < \Ax\oo ^2 \ k - 3\\djk\\u k \ < \Ax\oo jnax y^ |fc- j\\d jk 

3=0 j,k=0 \ ~ ''"' j=0 

from which it follows that 

N 

p([D,A]) < CilAa-loo, C x := _max ^|A;-j||djfc|. 

'~°'"" 3=0 

The important point to notice here is that for each fixed k, the sum in the expression for C\ 
involves at most 26 + 1 non- vanishing terms, since djk = for \k — j\ > b. For the SBP operators 
and scalar products used in practice, namely those for which the latter has the structure given by 




Equation ( 289 ) , C\ can be bounded by a constant which is independent of resolution. Since the 
spectral radius of B is equal to the one of its transposed, we may interchange the roles of j and k 
in the definition of the constant C\, and with these observations we arrive at the following result: 



120 



Lemma 7 (Discrete commutator estimate). Consider a finite difference operator D of the form { 293) 



which satisfies the SBP property with respect to a scalar product S, and let A = A T be symmetric. 
Then, the following commutator estimate holds: 



\(u,[d,a}u)^\<c\a x u\u\\1 

for all grid functions u and resolutions N , where C := min{Ci, C2} with 



N 



N 



C-i :— max > \k — ?||<i,,J, C? := max > \k — j\\d 



jk\ 



(294) 



(295) 



3=0 



fe=0 



Remarks 



• A key ingredient used above to uniformly bound the norm of [D, A] is that the SBP scalar 
products used in practice have the form (289). In those cases, both the boundary width 



and the boundary stencil size (defined below Example 52 ) associated with the corresponding 



difference operators are independent of N. Therefore, the constants C\ and C2 can also be 
bounded independently of N. 



• For the -D2-1 operator defined in Example 51 for instance, Equation (2951 gives C\ = 3/2, 
C2 = 1, and we obtain the optimal estimate corresponding to the one in the continuum limit, 



(u,[ Tx ,A]u) 



<\A X \ 



• For the D^_ 2 operator defined in Example 52 in turn, Equation (295 1 gives C\ = 1770/731 ~ 
2.421 and C 2 = 62/27 « 2.296 

• For spectral methods the constants C\ and C2 typically grow with N as the coefficients 
djk do not form a banded matrix anymore. This leads to difficulties when estimating the 
commutator, see Ref. |395) for a discussion on this point. 



A straightforward energy estimate shows that the skew-symmetric discretization 

u t = -(AD + DA)u--A x u, 



leads to (with appropriate imposition of boundary conditions, discussed in Section 10 ) strict 
stability. 

• Summation by Parts by itself is not enough to obtain an energy estimate since the boundary 
conditions yet still need to be imposed, and in a way such that the boundary terms in the 
estimate after SBP are under control. This is the topic of Section [TO] 



8.5 Numerical dissipation 

The use of numerical dissipation consistently with the underlying system of equations is a standard 
way of filtering unresolved modes, stabilizing a scheme, or both, without spoiling the convergence 
order of the scheme. As an example of unresolved modes, for centered differences, the mode with 
highest frequency for any given resolution does not propagate at all, while the semidiscrete group 
velocity with highest frequency is exactly in the opposite direction to the one at the continuum. In 
addition, the speed increases with the order of the scheme. See, for example, Ref. [273] for more 
details. 
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Some schemes, such as those with upwind finite differences, are intrinsically dissipative, with a 
fixed "amount" of dissipation for a given resolution. Another approach is to add to the discretiza- 
tion a dissipative operator Qd with a tunable strength factor e > 0, 

u =(•••) -* u =(•■•) + eQdU . 

The operator Qd is a higher order derivative compared to the principal part of the equation, 
mimicking dissipative physical systems and/or parabolic equations but in such a way that \\Qd\\ —* 
as Ax — > 0. Furthermore, Qd is usually chosen so that Qd scales with the gridspacing as the 
highest finite difference approximation (so that the amplification factor depends only on At/ Ax). 
For example, for first order in space systems finite differences scale as 

D ~ -L (296) 

Ax 

and Qd is usually chosen to scale in the same way. More precisely, in the absence of boundaries 
the standard way to add numerical dissipation to a first order in space system is (Kreiss-Oliger) 

Q d = {-ly-^Axf^D^D^ , (297) 

where D^ denotes the application of D r times and D + ,D^ denote forward and backward one- 
sided finite differences, respectively. Thus Qd scales with the gridspacing as (As) -1 , like Equa- 



tion (296). If the accuracy order of the scheme is not higher than (2r — 1) in the absence of 



dissipation, it is not decreased by the addition of numerical dissipation of the form ( 297 ) . 

The main property that sometimes allows numerical dissipation to stabilize otherwise unstable 
schemes is when they strictly carry away energy (as in the energy definitions involved in well 



posedness or numerical stability analysis) from the system. For example, the operators (297) are 
semi-negative definite 

(u,Q d u) s <0. (298) 

with respect to the trivial scalar product S = I, under which centered differences satisfy SBP. 



In the presence of boundaries it is standard to simply set the operators ( 297 1 to zero near them 



The result is in general not semi- negative definite as in (298), which can not only not help resolve 



instabilities but also trigger them. Many times this is not the case in practice if the boundary is an 



outer one where the solution is weak, but not for inter-domain boundaries (see Section 10 ). For 
example, for a discretization of the standard wave equation on a multi-domain, curvilinear grid 
setting, using the Dq_§ SBP operator with Kreiss-Oliger dissipation set to zero near interpatch 
boundaries does not lead to stability while the more elaborate construction below docs [138J. 

For SBP-based schemes adding artificial dissipation may lead to an unstable scheme unless the 
dissipation operator is semi- negative under the SBP scalar product. In addition, the dissipation 
operator should ideally be non-vanishing all the way up to the boundary and preserve the accuracy 
of the scheme everywhere (which is more difficult in the SBP case, as it is non-uniform). In 
Ref. 295;, a prescription for operators satisfying both conditions for arbitrary high order SBP 
scalar products, is presented. A compatible dissipation operators is constructed as 

Q d = -(AxfP ^- l D?B p D p , (299) 

where S is the SBP scalar product, D p is a consistent approximation of d p /dx p with minimal 
bandwidth (other choices are presumably possible), and B p is the so-called boundary operator. 
The latter has to be positive semi-definite and its role is to allow boundary points to be treated 
differently from interior points. B p cannot be chosen freely, but has to follow certain restrictions 
(which become somewhat involved in the non-diagonal SBP case) based on preserving the accuracy 
of the schemes near and at boundaries, sec 12951 for more details. 
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8.6 Going further 

Besides the applications already mentioned, high order finite difference operators satisfying SBP 
have been used, for example, in simulations of binary black holes immersed in an external magnetic 
field in the force free approximation |314j , orbiting binary black holes in vacuum [317j , and for the 
metric sector in binary black hole-neutron star evolutions |121j and binary neutron star ones which 
include magneto hydrodynamics [22] . Other works are referred to in combination to multi-domain 
interface numerical methods in Section 1101 

In Ref. [386] the authors present a numerical spectrum stability analysis for block-diagonal 
based SBP operators in the presence of curvilinear coordinates. However, the case of non-diagonal 
SBP norms and the full Einstein equations in multi-domain scenarios for orders higher than four 
in the interior needs further development and analysis. 

Efficient algorithms for efficiently computing the weights for generic finite differences operators 
(though not necessarily satisfying SBP or with proven stability) are given in [162] . 

Discretizing second order time dependent problems without reducing them to first order leads to 
a similar concept of Summation By Parts for operators approximating second derivatives. There is 
steady progress in an effort to combine SBP with penalty interface and outer boundary conditions 
for high order multi-domain simulations of second order in space systems. At present though these 
tools have not yet reached the state of those for first order systems, and they have not been used 
within numerical relativity except for the test case of a 'shifted advection equation' |294j . The 
difficulties appear in the variable coefficient case. We discuss some of these difficulties and the 
state of the art in Section [TUJ In short, unlike the first order case, SBP by itself does not imply an 
energy estimate in the variable coefficient case, even if using diagonal norms, unless the operators 
are built taking into account the PDE as well. In Ref. [292J the authors explicitly constructed 
minimal width diagonal norms SBP difference operators approximating cP/dx 2 up to eighth order 
in the interior, and in Ref. |115j non minimal width operators up to sixth order using full norms 
are given. 

Ref. [427J presents a stability analysis around flat space-time for a family of generalized BSSN- 
type formulations, along with numerical experiments which include binary black hole inspirals. 

SBP operators have also been constructed to deal with coordinate singularities in specific 
systems of equations [102. 363] 1217] . Since a sharp semi-discrete energy estimate is explicitly 
derived in these references, (strict) stability is guaranteed. In particular, in Ref. [217J schemes 
which converge pointwise everywhere -including the origin- are derived for wave equations on 
arbitrary space dimensions decomposed in spherical harmonics. Interestingly enough, popular 
schemes 154 to deal with the singularity at the origin, which had not been explicitly designed 
to satisfy SBP, were found a posteriori to do so at the origin and closed at the outer boundary, 
see [217] for more details. In these cases the SBP operators are tailored to deal with specific 
equations and coordinate singularities, therefore they are problem dependent. For this reason 
their explicit construction has so far been restricted to second and fourth order operators (with 
diagonal scalar products), though the procedure conceptually extends to arbitrary orders. For 
higher order operators, optimization of at least the spectral radius might become necessary to 
address. 

In Ref. |162j the authors use SBP operators to design high order quadratures. The reference 
also includes a detailed description of many properties of SBP operators. 

Superconvergence of some estimates in the case of diagonal SBP operators is discussed in |234) . 
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9 Spatial Approximations: Spectral Methods 

In this section we review some of the theory for spectral spatial approximations, and their applica- 
tions in numerical relativity. These are global representations, which display very fast convergence 
for smooth enough functions. They are therefore very well suited for Einstein's vacuum equations, 
where physical shocks are not expected since they can be written in linearly degenerate form. 

We start in Section |9~l"1 discussing expansions onto orthogonal polynomials which are solutions 
to Sturm-Liouville problems. In those cases it is easy to see that for smooth functions the decay of 
the error in truncated expansions with respect to the number of polynomials is in general faster than 



any power law, which is usually referred to as spectral convergence. Next, in Section 9.2 we discuss 
a few properties of general orthogonal polynomials; most important that they can be generated 
through a three-term recurrence formula. Follows Section |9.3| with a discussion of the most used 
families of polynomials in bounded domains; namely, Legendre and Chebyshev ones, including the 
minmax property of Chebyshev points. Approximating integrals through a global interpolation 
with a careful choice of nodal points makes it possible to maximize the degree with respect to which 
they are exact for polynomials (Gauss quadratures). When applied to compute discrete truncated 
expansions, they lead to two remarkable features. One of them is Summation by Parts (SBP) 
for Legendre polynomials, in analogy with the finite difference version discussed in Section |8.3| 
As in that case, SBP can also be sometimes used to show semidiscrete stability when solving 
time-dependent partial differential equations (PDEs). The second one is an exact equivalence, 
for general Jacobi polynomials, between the resulting discrete expansions and interpolation at the 
Gauss points, a very useful property for collocation methods. Gauss quadratures and SBP are 
discussed in Section |9.4| followed by interpolation at Gauss points in Section |9.5| In Sections |9.6[ 
|9.7| and |9.8| we discuss spectral differentiation, the collocation method for time-dependent PDEs, 
and applications to numerical relativity. 

The results for orthogonal polynomials to be discussed are classical ones, but we present them 
because spectral methods are less widespread in the relativity community, at least compared to 
finite differences. The proofs and a detailed discussion of many other properties can be found in, 
for example, [191] and references therein. Ref. |232| is a modern approach to the use of spectral 
methods in time-dependent problems with the latest developments, while [55] discusses many issues 
which appear in applications, and [161] presents a very clear practical guide to spectral methods, in 
particular to the collocation approach. A good fraction of our presentation of this section follows 
|191j and [232) . to which we refer when we do not provide any other references, or for further 
material. 

9.1 Spectral convergence 

9.1.1 Periodic functions 

An intuition about expansion of smooth functions into orthogonal polynomials and spectral con- 
vergence can be obtained by first considering the periodic case in [0, 2tt] and expansion in Fourier 
modes, 

Pj(x) := ~ 1 =e i}x for integer j E Z. (300) 



/2tt 
These are orthonormal under the standard complex scalar product in £ 2 ([0, 27r]), 



(f,g):= I f(x)g(x)dx, /,<?eL 2 ([0,27r]), (301) 

(Pj,Pf) = S if ■ ( 302 ) 
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Furthermore, they form a complete basis under the norm induced by the above scalar product. 
More explicitly, the expansion of a continuous, periodic function in these modes, 



f(x) = E ^pj( x ) 



(303) 



j=-oo 



converges to / in the L 2 norm if 



E i/i 



< oo . 



The Fourier coefficients /,• can be computed from the orthonormality condition ( 302 1 of the 



basis elements defined in Equation ( 300 ) , 



fi = (PjJ) = 



f(x)e~ ijx dx . 



(304) 



The truncated expansion of / is (assuming N to be even) 

N/2 

v N [f\{x)= y, few- 

j = -N/2 



(305) 



where the notation is motivated by the fact that the Vn operator can also be seen as the orthogonal 
projection under the above scalar product to the space spanned by {pj : j = —N/2 . . . N/2} see 
also Section [9.2| . The error of the truncated expansion, using the orthonormality of the basis 
functions (Parseval's property) is 



ll/-7M/]l| 2 = E I/ 

\ 3 \>N/2 



3\ ) 



(306) 



from which it can be seen that a fast decay in the error relies on a fast decay of the high fre- 
quency Fourier coefficients |/j| as j -> oo. Using the explicit definition of the basis elements pj in 
Equation (300) and the scalar product in (301), 



l/il = l<P,-./>l 



f 



/2tt 



2tt 



f(x)e~' l]x dx 



Integrating by parts multiple times, 

2tt 
l/il 



'2ir\j\ 



f'{x)e- l3X dx 



Mi\ 6 



f [s \x)e- llx dx 



w- 



(f {s \p 3 ) 



< 



1 



(*) 



m\ 



i 



(*) 



and the process can be repeated for increasing s as long as the s-derivative f^ s ' remains bounded 
in the I? norm. In particular, if / G C°°, then the Fourier coefficients decay to zero 

\fj\ -> as j -> oo 
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faster than any power law, which is usually referred to as spectral convergence. The spec- 
tral denomination comes from the property that the decay rate of the error is dominated by the 
spectrum of an associated Sturm-Liouville problem, as discussed below. The convergence rate for 



each Fourier mode in the remainder can be extended to the whole sum ( 306 1 . More precisely, the 
following result can be shown (see, for example, |232j ): 



Theorem 16. For any f e 



H p s [0,27rl 



(p standing for periodic) there exists a constant C > 



independent of N such that for all N > 1 



\\f-v N [f}\\<CN- 



d s f 



dx s 



(307) 



In fact, an estimate for the difference between u and its projection similar to (307) but on the 
infinity norm can also be obtained |232j . 

In preparation for the discussion below for non-periodic functions, we rephrase and re-derive 
the previous results in the following way. Integrating by parts twice, the differential operator 

V = -dl 



is self-adjoint under the standard scalar product (|301|, 

(f,Vg) = (Vf,g) 

for periodic, twice continuously differcntiable functions / and g. Therefore the eigenfunctions pj 
of the problem 

Vpj{x) =\jPj(x) (308) 



are orthogonal (and can be chosen orthonormal) - they turn out to be the Fourier modes (300) - 
represent an orthonormal complete set for periodic functions in L 2 , the expansion (303) converges, 



and the error in the truncated expansion (305 1 is given by the decay of high order coefficients, see 



Equation (306). Assuming / is smooth enough, the fast decay of such modes is a consequence of 



V being self-adjoint, the basis elements pj being solutions to the problem (308), 



Pj = \r v P3 » 

and the eigenvalues satisfying Xj ~ j 2 for large j (in the Fourier case, Xj = j 2 holds exactly). 
Combining these properties, 



l/il = K/^)I = T^K/.^)I = ^|IW^)I = |^ 



{Wf, P3 ) 



i 



< 



i 



j)Wf 



(v (s) f, Pj 

(309) 



where V^ denotes the application of V s times (in this case, T>^ is equal to (— l) s 9| s ). 

The main property th at le ads to spectral convergence is then the fast decay of the Fourier 
coefficients, see Equation (309), provided the norm of D' s ) remains bounded for large s. 

Before moving to the non-periodic case we notice that in either the full or truncated expansions 



the integrals (304) need to be computed. Numerically approximating the latter leads to discrete 



expansions and an appropriate choice of quadratures for doing so leads to a powerful connection 
between the discrete expansion and interpolation. We discuss this in Section |9.4[ directly for the 
non-periodic case. 
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9.1.2 Singular Sturm Liouville problems 

Next, consider non-periodic domains (a, b) (which can actually be unbounded; for example, (0, oo) 
as in the case of Laguerre polynomials) in the real axis. We discuss how bases of orthogonal 
polynomials with spectral convergence properties arise as solutions to singular Sturm-Liouville 
problems. 

For this we need to consider more general scalar products. For a continuous, strictly positive 
weight function u> on the open interval (a, b) we define 

b 
(h,g) u = h(x)g(x)u(x)dx, (310) 



and its induced norm, ||/i|| w := y/(h, ti)^, on the Hilbert space L^j(a,b) of all real-valued, measur- 
able functions h,g on the interval (a, b) for which \\h\\ u and \\g\\ u are finite. 
Consider now the Sturm-Liouville problem 

Vpj{x) — uj(x)XjPj{x) (311) 

where I? is a second order linear differential operator on (a, b) along with appropriate boundary 
conditions so that it is self-adjoint under the non-weighted scalar product, 

(f,T>g) u =i = (Df,g) u =i, (312) 

for all twice continuously differentiable functions /, g on (a, b) which are subject to the boundary 
conditions. Then the set of eigenfunctions is also complete, and orthonormal under the weighted 
scalar product, and there is again a full and truncated expansions as in the Fourier case, 

oo 

/o) =^2fjPj(x), 

j=0 

N 

V N [f]{x) = Y J f ] P 3 {*)i 

3=0 

with coefficients 

fj = (Pj,f)w 

The truncation error is similarly given by 

ll/-7M/]ll* = E/J 

j>N 

and spectral convergence is again obtained if the coefficients fj decay to zero as j — > oo faster than 
any power law. Consider then, the singular Sturm-Liouville problem 

T>pj(x) — —d x (m(x)d x pj(x)) + n{x)pj(x) = uj{x)\jPj{x) (313) 

with the functions m, n : (a, b) — >■ M being continuous and bounded and such that n(x) > and 
m{x) > for all x € (a, b) and - thus the singular part of the problem -, 

m{a) = m{b) = 0. (314) 

For twice continuously differentiable functions with bounded derivatives, the boundary terms aris- 



ing from integration by parts of the expression (f,T>g) u =i cancel due to Equation (314), and it 
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follows that the operator T> is symmetric, see Equation (312 1. Therefore one can proceed as in the 
Fourier case and arrive to 



l/,-|< 



1 



Ail 4 



W 



/ 



with spectral convergence if / € C°° and, for example, 

\Xj\ ~ j 2 , for large enough j > j„ 



(315) 



Theorem 17. TTie solutions to the singular Sturm-Liouville problem JjJjp urai/i (a, 6) = (—1,1) 
and 



m(x) = (l-x) Q+1 (l + x) 1+ ^, 
u(x) = (l-x) a (l + x)P, 
n(x) = 0, 



(316) 

(317) 
(318) 



where a, /3 > ~1, are the Jacobi polynomials P- ( x )- Here, P ■ (x) /ias degree j, and it 
corresponds to the eigenvalue 

X j =j(j + a + P + l). 



Notice that the eigenvalues satisfy the asymptotic condition (315) and, roughly speaking, guar- 



antees spectral convergence. More precisely, the following holds (see, for example, [191] ) - in 
analogy with theorem 16 for the Fourier case - for the expansion Vn [/] of a function / in Jacobi 
polynomials: 

Theorem 18. For any f(x) G H^[0,2tt] (uj refers to the weight junction) there exists a constant 
C > independent of N such that 



\\f-V N [f]\U<CN- 



(i-x 2 y/ 2 



dx s 



V7V> s. 



(319) 



Sturm-Liouville problems are discussed in, for example, [419] . Below we discuss some properties 
of general orthogonal polynomials. 

9.2 Some properties of orthogonal polynomials 

Given any weighted scalar product (•, -) u as in Equation ( |310[ ), the expansion of a function onto 
polynomials is optimal in the associated norm if orthogonally projected to the space of polynomials 
Pat of degree at most N. More precisely, given a function / G L^{a, b) on some interval (a, b) and 
an orthonormal basis Pjy of polynomials {pj}f =0 , where pj has degree j, its orthogonal projection, 



A' 



J=0 



onto the subspace spanned by Pjv satisfies 

V N [f] = 



min \\f~f N \\l, 



or 



\\P N \f\-f\\ u <\\f N -f\\ u 
for all /at in the span of Pat, that is, it minimizes the error for such f^. 



(320) 

(321) 
(322) 
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The operator (320) is a projection in the sense that 



and it is orthogonal with respect to 



V 2 N =V N , 
: the residual r := / — VnU] satisfies 
(r,V N [f]) u = 0. 



Notice that unlike the interpolation problem (discussed below) here the solution of the above least 
squares problem ( 321[ ) is not required to agree with / at any prescribed set of points. 

In order to obtain an orthonormal basis of Pjy a Gram-Schmidt procedure could be applied 
to the standard basis {x°}™ = q. However, exploiting properties of polynomials, a more efficient 
approach can be used, where the first two polynomials po,pi are constructed and then a three- 
term recurrence formula is used. 

In the following construction each orthonormal polynomials is chosen to be monic, meaning 
that its leading coefficient is one. 

• The zero-th order polynomial: 

The conditions that po has degree zero and that it is monic only leaves the choice 

Po(a?) = l. (323) 

• The first order one: 

Writing pi(x) — x + b\ the condition (po,pi)oj = yields 



Pi(x) 



(1,1) 



(324) 



• The higher order polynomials: 

Theorem 19. (Three-term recurrence formula for orthogonal polynomials) 

For monic polynomials {pk}k=o which are orthogonal with respect to the scalar product 
where each p k is of degree k, the following relation holds 



fork = 1,2,3,. 



Pfc+i 

.,N-1. 



XPk 



{xpkiPk) 

(Pk,Pk)c 



~Pk 



(xp k ,Pk- 



lluj 



ipk-l,pk- 



-Pk-l 



(325) 



Proof. Let 1 < k < N — 1. Since xpk is a polynomial of degree k + 1, it can be expanded as 

fc+i 
xpk{x) = y2ajPj(x), 



j=o 



where the orthogonality of the polynomials {pk\k=a implies that aj = (xpk,Pj)ui/(Pj,Pj)u 
for j = 0, 1, 2, . . . , k + 1. However, since (xpk,Pj) u — (Pk,xPj) u an d xpj can be expanded 
in terms of the polynomials po,pi, . . . ,Pj+i, it follows again by the orthogonality of {pk}k=o 
that aj = for j ; < k — 2. Finally, ak+i = 1 since pk and Pk+i are both monic. This proves 
Equation ((3251). □ 



Notice that Pk+i as defined in Equation (3251 remains monic and can therefore be automat- 



ically used for constructing Pk+2 , without any rescaling. 



Equations ( 323[ 324 3251 allow to compute orthogonal polynomials for any weight function w, 
without the expense of a Gram-Schmidt procedure. For specific weight cases, there are even more 



explicit recurrence formulae, such as those in Equations (326 327) and (329 330 331 ) for Legendre 
and Chebyshev polynomials, respectively. 
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9.3 Legendre and Chebyshev polynomials 

For finite intervals, Legendre and Chebyshev polynomials are the ones most typically used. In the 
Chebyshev case, the polynomials themselves, their roots and quadrature points in general can be 
computed in closed form. They also satisfy a minmax property and lend themselves to using a 
Fast Fourier Transform (FFT). 

9.3.1 Legendre 

Legendre polynomials correspond to the trivial weight function w=l and the choice a — j3 = in 



Equations (316 317 318). The eigenvalues are 

Ai=i(i + i), 

and the first two polynomials 

Po(x) = l, P 1 {x)=x. (326) 

A variation (in that it leads to non-monic polynomials) of the three term recurrence formula (1325 ) 
leading to the normalization 



W)= ¥TT P P iW- 7TT P i-iW» 3 = 1,2,3,..., (327) 



(Pi,Pj) u = J Pi(x)Pj(x)dx = g-^TjA 
9.3.2 Chebyshev 



1/2 



Chebyshev polynomials correspond to the choice u(x) = (l — x 2 ) , x G (— 1, 1) and a = f3 = 



— 1/2 in Equations (316 317 318). In particular, the eigenvalues are 



A, = .f ■ 

A closed-form expression for Chebyshev polynomials (which lends itself to the use of FFT) is 

T 3 (x) = cos(jcos- 1 {x)) , j = 0,1,2,..., (328) 

At first sight it might appear confusing that, given the above definition through trigonometric 
functions, T n (x) are actually polynomials in x (of degree n, in fact). To get an idea of why this is 
so, we can compute the first few: 

T (x) = cos(O) 

= 1, 

T\{x) — cos(cos _1 (x)) 

= x, 

T 2 {x) = cos(2cos _1 (a;)) = -1 + 2cos 2 (cos _1 (x)) 

= -l + 2x 2 . 

Notice from the above expressions that T%(x) = 2xT\{x) — Tq. In fact, a slight variation of the 



three-term recurrence formula ( 325 1 for Chebyshev polynomials becomes 

T (x) = 1 (329) 

Ti(x) = x (330) 

T J+1 (x) = 2xT j (x)-T j _ l , j = 1,2,3,.... (331) 
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As in the Legendre case this is a variation of Equations (323 324 3251 in that the resulting 
polynomials are not monic: the leading coefficient is given by 

T j {x) = 2^- l x j + ..., 

That is, the polynomial 2 1 ~ : 'Tj(x) is monic. 

Both Equation (328) and Equation (33f ) lead to the normalization 



(T i ,T j ) u = jT i (x)T j (x)(l-x 2 ) 
-l 



-1/2 



dx = 5,; 



2 



for i = 
for i > 



From the expression (328), it can be noticed that the roots {xj} of Tjv +1 arc 

j =0,1,2,..., TV. 



( 2 j + 1 

COS — — -7T 



(332) 



\2N + 2 
These points play an important role below in Gauss quadratures and collocation methods (Sec- 



tion 



9.4) 



9.3.3 The minmax property of Chebyshev points 

Chebyshev polynomials satisfy a rather remarkable property in the context of interpolation. In 



Section 8.1 we pointed out that the error in polynomial interpolation of a function / at (N + 1) 
nodal points Xj satisfies [cf. Equation (277)] 



E N (x) = 



(N + iy. 



f (N+1 H^)LO N+1 (x) 



T N 



where u N+1 {x) := II^o^ ~ x i)- 

When doing global interpolation, that is, keeping the endpoints {xo,Xn} fixed and increasing 
N, it is not true that the error converges to zero even if the function is C°° . For example, for each 
N, f( N+1 '(x) could remain bounded as a function of x, 

\f {N+1) (x)\<C N Vx€[x ,x N }, 

but C n could grow with N. A classical example of non-convergence of polynomial interpolation on 
uniform grids is the following (see, for example, [150] ): 



Example 53. Runge phenomenon. 
Consider the function 



m = 



l + x 2 



x e [-5,5] 



and its interpolating polynomial Xjv[/](x) [cf. Equation (274)] at equally spaced points 



-5+ -10, 



i = 0, 



,N. 



Then 

\f(x)-l N [f}(x)\^oo 

where x c w 3.63. 



as N — > oo \/x such that x c < \x\ < 5 . 



The error ( 277 ) can be decomposed into two terms, one related to the behavior of the derivatives 



°f /; f^ N+1 H^x)/{N + 1)!, and the another one related to the distribution of the nodal points, 
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<^N+i( x )- We assume in what follows that x € [aJoi^iv] and that [x ,xn] = [— 1, 1]. The analogue 
results for an arbitrary interval can easily be obtained by a shifting and rescaling of coordinates. 
It can then be shown that for all choices of nodal points 

max \u N+1 (x)\ > 2- N . (333) 

ase[-l,l] 

Furthermore, the nodes which minimize this maximum (thus the minmax term) are the roots of 
the Chebyshev polynomial of order (n + 1), for which the equality is achieved: 



max Iwjv+i(a0| — 2 

xe[-i,i] 



-N 



when {xo,Xi, . . . ,Xn} are given by Equation (332); see, for example, |135j . for the proof. 



In other words, using Chebyshev points, that is, the roots of the Chebyshev polynomials, as 
interpolating nodes minimizes the maximum error associated with the nodal polynomial term. 
Notice that in this case, the nodal polynomial is given by Tn+i(x)/2 n . 

9.4 Gauss quadratures and Summation By Parts 

When computing a discrete expansion in terms of orthogonal polynomials 

N 

In{x) =^2fjPj{x) 
one question is how to efficiently numerically approximate the coefficients {fj} as given by Equa- 



tion ( 320 ) . This involves computing weighted integrals of the form 

b 
q(x)uj(x)dx. (334) 



If approximating the weighted integral ( 334 ) by a quadrature rule, 

b . N 

I q(x)uj(x)dx « ^^ ^9( x i): (335) 

where the points {x{\ are given but having the freedom to choose the coefficients {^4j}, by a count- 



ing argument one would expect to be able to choose the latter in such a way that Equation ( 335 1 is 



exact for all polynomials of degree at most N. That is indeed the case, and the answer is obtained 



by approximating q(x) by its polynomial interpolant (274) and integrating the latter 



q(x)w(x)dx fa I y^q(xi)£\ (x)oj{x)dx = \^A\ q(xi) 
a '=° >=° 

oN\N 



where the {if }f =0 are the Lagrange polynomials (276) and the coefficients 



6 
A\ N) = f i ( f\x)u{x)dx J = 0,l,...,iV, (336) 
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are independent of the integrand q{x). If the weight function is non-trivial they might not be 
known in closed form, but since they are independent of the function being integrated they need 
to be computed only once for each set of nodal points {xi}. 

Suppose now that, in addition to having the freedom to choose the coefficients {Ai} we can 
choose the nodal points {xi}. Then we have (iV+1) points and (N + l) {Ai}, i.e., (2N + 2) degrees 
of freedom. We therefore expect that we can make the quadrature exact for all polynomials of 
degree at most (2N + 1). This is indeed true and is referred to as Gauss quadratures. Furthermore, 
the optimal choice of Ai remains the same as in Equation ( 336 1 , and only the nodal points need 
to be adjusted. 

Theorem 20 (Gauss quadratures). Let to be a weight function on the interval (a,b), as introduced 
in Equation (310), and let be p n +\ be the associated orthogonal polynomial of degree N-j-1. Then, 
the quadrature rule (335) with the choice (336) for the discrete weights, and as nodal points {xj} 
the roots of pn+i is exact for all polynomials of degree at most (2N + 1). 

The following remarks are in order: 

• The roots of p n Ni{x) are referred to as Gauss points or nodes. 

• Suppose that u>(x) = (1 — x 2 )~ 1 / 2 . Then the (N + 1) Gauss points, i.e., the roots of the 
Chebyshev polynomial T/v+i(x) [see Equation (338)], are exactly the points that minimize 



the infinity norm of the nodal polynomial in the interpolation problem, as discussed in 
Section ESM 

One can see that the Gauss points actually lie inside the interval (a, b), and do not contain the 
endpoints a or b. Now suppose that for some reason we want the nodes to include the end points 
of integration, 

xq = a, xn = b. 

One reason for including the end points of the interval in the set of nodes is when applying boundary 
conditions in the collocation approach, as discussed in Section [T0| Then we are left with two less 
degrees of freedom compared to Gauss quadratures and therefore expect to be able to make the 
quadrature exact for polynomials of order up to (27V + 1) — 2 = (2N — 1). This leads to: 

Theorem 21 (Gauss-Lobatto quadratures). If we choose the discrete weights according to Equa- 
tion (336) as before but as nodal points the so called Gauss-Lobatto ones, i.e., the roots of the 



polynomial 



m N+1 (x) = p N +i(x) + ap N (x) +/3pAr_i(x), 



(337) 



with a and (3 chosen so that r?ijv+i(a) = = m n+ i(b), then the quadrature rule (335) is exact for 
all polynomials of degree at most (2N — 1). 

Note that the coefficients a and /? in the previous equations are obtained by solving the simple 
system 

m N+1 (a) = =p N+ i(a)+ap N (a)+l3p N -i{a), 

m N+ i(b)= =p N+ i{b) +ap N (b) + [3p N -i(b). 

One can similarly enforce that only one of the end points coincides with a quadrature one, 



leading to Gauss-Radau quadratures. The proofs of theorems 20 and 21 can be found in most 
numerical analysis books, in particular |237j . 

For Chebyshev polynomials there are closed form expressions for the nodes and weights in 



Equations (335 and 3361 
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Chebyshev Gauss quadratures. For j = 0, 1, . . . , N, 

Xj = — cos — - — -7r (338) 

4*" = *TT (330) 

Chebyshev Gauss Lobatto quadratures. 

s i = -cosf^V forj = 0,1,..., iV, (340) 



iV 



aw 



7T 
JV 

7T 
27V 



forj = 1,2,..., (TV -1) 
for j = and N 



Summation By Parts. For any two polynomials p(x) 1 q(x) of degree N, in the Legendre case 
Summation By Parts follows for Gauss, Gauss-Lobatto or Gauss-Radau quadratures, in analogy 
with the finite difference case (Section |8.3[ ). 

Since both products q{x)dp{x) / dx and p(x)dq{x) / dx are polynomials of degree (2n — 1), their 
quadratures are exact (in fact, the equality holds for each term separately): 

d \ I d \ Id 



d P,q) +\P>te<I/ = \^'V + \ P >dx~ q 
ax j \ ux j \ux j \ ax 



where we have introduced the discrete counterpart of the weighted scalar product (310) 

JV 



( h >9)1 ■^^A l h(x i )g(x, i ), 



i=0 

with the nodes {x{\ and discrete weights {A{\ those of the corresponding quadrature. 
On the other hand, in the Legendre case, 

-faPil) + \ P, dx~ q ) = P(%( & ) - K a M«) > 

and therefore 

(Am) +(p^i) =P(b)q(b)-p(a)q(a). (341) 



Property (341 1 will be used in Section 9.7 when discussing stability through the energy method, 



much as in the finite difference case. 

9.5 Discrete expansions and interpolation 

Suppose one approximates a function / through its discrete truncated expansion, 

N 

f(x)*Klf](x) = J2f}P3< X )- ( 342 ) 

3=0 
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That is, instead of considering the exact projection coefficients {fj}, these are approximated by 
discretizing the corresponding integrals using Gauss, Gauss-Lobatto or Gauss-Radau quadratures, 

b . N 

fj = (Pj»/>w = / f{x)p {x)u{x)dx « ff := ^2f(xi)pj(Xi)Ai 
{ »=o 



with Ai given by Equation (336) and {xi} any of the Gauss-type points. Putting the pieces 
together, 

N / N 

i=0 \ 3=0 

Since if / is a polynomial of degree smaller than or equal to N, the discrete truncated expansion 
V%[f] = / is exact for Gauss or Gauss-Radau quadratures according to the results in Section 9.4 it 
follows that the above term inside the square parenthesis is exactly the i-th Lagrange interpolating 
polynomial, 

N 

Ai y £ i p j {x i )p j (x)=e?(x). 

j=o 

Therefore, we arrive at the remarkable result: 

Theorem 22. Let ui be a weight function on the interval (a,b), as introduced in Equation j3T(a ), 
and denote by V^lf] the discrete truncated expansion of f corresponding to Gauss, Gauss-Lobatto 
or Gauss-Radau quadratures. Then, 

N 

n[/](z)=nf](z)=£/(^f(*)- (343) 

i=0 

That is, the discrete truncated expansion in orthogonal polynomials of f is exactly equivalent to 
the interpolation of f at the Gauss, Gauss-Lobatto or Gauss-Radau points. 

The above simple proof did not assume any special properties of the polynomial basis, but does 
not hold for the Gauss-Lobatto case (for which the associated quadrature is exact for polynomials 
of degree 2N—1). The result, however still holds (at least for Jacobi polynomials), see, for example, 



Examples of Gauss- type nodal points {x^ are those given in Equation (338) or Equation (340) 



As we will see below, the identity ( |343| ) is very useful for spectral differentiation and collocation 
methods, among other things, since one can equivalently operate with the intcrpolant, which only 
requires knowledge of the function at the nodes. 

9.6 Spectral collocation differentiation 



The equivalence ((343) between the discrete truncated expansion and interpolation at Gauss-type 



points allows the approximation of the derivative of a function in a very simple way, 
ri d d N d 



Therefore, knowing the values of the function / at the collocation points, i.e., the Gauss- type 
points, we can construct its interpolant V%, take an exact derivative thereof, and evaluate the 
result at the collocation points to obtain the values of the discrete derivative of / at these points. 
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This leads to a matrix-vector multiplication, where the corresponding matrix elements Dij can be 
computed once and for all: 



dx 



f{ Xi ) « Dijfixi) i = 0,l,...,N, 



with 



ij ' _ dx ' 



We give the explicit expressions for this differentiation matrix for Chebyshev polynomials both at 
Gauss and Gauss-Lobatto points (see, for example, |161[I232] ). 



Chebyshev Gauss. 



D, 



2(1 -a?) 



(Xi Xj)l N+1 [Xj) 

with a prime denoting differentiation. 
Chebyshev Gauss Lobatto. 

2iV 2 + l 



for i = j, 
for i ^ j, 



D, 





6 


Ci (- 


_iy+j 


c 3 x, 


— Xj 




Xi 


20 


-4) 


2N 2 


+ i 



G 



for i = j = 0, 

for i ^ j, 

for i = j = 1, . 

for i = j = N, 



(N-l), 



where c, = 1 for z = 1, . . . (N — 1) and Cj = 2 for i = 0, N. 

9.7 The collocation approach 

When solving a quasilinear evolution equation 

Ut = P(t, x, u, d/dx)u + F(t, x, u) 



(344) 



using spectral expansions in space and some time evolution scheme, one could proceed in the 
following way: Work with the truncated expansion of u(t, x), 



N 



V N [u](t,x) = y^bi(t)pi{3 



i=0 



and write the PDE (344) as a system of (n + 1) coupled evolution ordinary differential equations 
for the bi(i) coefficients, subject to the initial condition 

bi(t = 0) = (u(t = 0,-), P i) u , 
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where the quadratures can be approximated using, say, Gauss-type points. One problem with this 
approach is that if the equation is nonlinear or already at the linear level with variable coefficients, 



the right hand side of Equation ( 344 1 needs to be re-expressed in terms of truncated expansions at 
each timestep. Besides the complexity of doing so, accuracy is lost because higher order modes due 
to nonlinearities or coupling with variable coefficients are not represented. Or, worse, inaccuracies 
from those absent modes move to lower frequency ones. This is one of the reasons why collocation 
methods are instead usually preferred for this class of problems. 

In the collocation approach the differential equation is exactly solved, in physical space, at the 



collocation points {xi}™ =0 , which are those appearing in Gauss quadratures (Section 9.4 1. Assume 



Ut(t,x) = ^2A J (t,x,u)-—u + F(t,x,u), 



for defmiteness that we are dealing with a symmetric hyperbolic system in three dimensions, 

d 

A-*(t,x,u)- ( 

j=l 
see Section |3.3| We approximate u by its discrete truncated expansion 

u N :=V%[u]. (345) 

Then the system is solved at the collocation points, 

d 3 q 

—u N (t,Xi) = J~] A 3 (t, Xj, u N ) -^-jU N (t, Xj) + F(t,Xi,u N ), (346) 

here the spatial derivatives are approximated using spectral differentiation as described in Sec- 



\v 



tion 



9.6 



The system can then be evolved in time using the preferred time integration method, see 
SectionTO 

From an implementation perspective, there is actually very little difference between a spectral 
collocation method and a finite difference one: the only two being that the grid points need to be 
Gauss-type ones and that the derivative is computed using global interpolation at those points. In 
fact, the actual projection (|345) never needs to be computed for actually solving the system (346 1: 



given initial data u(t = 0, x), by construction the interpolant coincides with it at the nodal points, 

u N (t = 0,Xi) — u N (t — 0,Xi) , (347) 



(347) 



and the system (3461 for u n (t,Xi) is directly numerically evolved subject to the initial condition 



Stability. As discussed in Section |SL4| in the Legendre case the discrete truncated expansion using 
Gauss-type quadratures leads to Summation by Parts (SBP). In analogy with the finite difference 



case (Section 8.3), when the continuum system can be shown to be well posed through the energy 
method, a semi-discrete energy estimate can be shown by using the SBP property, at least for 
constant coefficient systems, and modulo boundary conditions (discussed in the following section). 
Consider the same case discussed in Section [9~4"l a constant coefficient symmetric hyperbolic system 
in one dimension, 

u t = Au x , 



and a collocation approach 



d d 

—u N (t, Xi) = A—u N (t, Xi), (348) 



at Legendre-Gauss-type nodes. Then defining 

E d = (u N ,u N )i =1 (349) 
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taking a time derivative, as in, Section |9. 4 



dE d 

-37- = \UN,Un) uz 



{U N ,U N ) 



dx 



Aun,un 



Au N , —u r 
ox 



(350) 



j=i 



Now, u^jduM /d x is a polynomial of degree (2N — 1) so the above discrete scalar product is exact 
for Gauss, Gauss-Lobatto and Gauss-Radau collocation points [cf. Equation (341 1], and if the 
system (such as symmetric hyperbolic ones) admits an energy estimate at the continuum under 
the L 2 norm, then semidiscrete numerical stability follows (modulo boundary conditions). 

The weighted norm case ui =^ 1 is more involved. In fact, already the advection equation is not 
well posed under the Chebyshev norm, see for example |232j . 



Spectral viscosity. In analogy with numerical dissipation ( Section |8. 5 [ ), spectral viscosity (SV) 
adds a resolution dependent dissipation term to the evolution equations without sacrificing spectral 
convergence. SV was introduced by Tadmor in |396j . For simplicity consider the Fourier case. Then 
spectral viscosity involves adding to the evolution equations a dissipative term of the form 



d i \ i t\s da 

- UN = (...)- etf (-iy— 



i(t,x) 



8 s Un 

dx s 



s > 1, 



where s is the (fixed) order of viscosity, the viscosity amplitude scales as 



£N 



N2s-1 ' 



c„ > o, 



and the smoothing functions Q m effectively apply the viscosity to only th e upp er portion of the 
spectrum. In more detail, if Qj(t) are the Fourier coefficients [cf. Equation (300)] of Q m (t,x) then 
they are only applied to frequencies j > ttim in such a way that they satisfy 

(2s-l)/0 



1 



\\J\ 



< 



,•(*) < 1 



with 



m N - N e 



< 



2s 



2.s 



The case s = 1 corresponds to a dissipation term involving a second derivative and s > 1 is referred 
to as super (or hyper) viscosity. Higher values of s (up tos~ vN) dissipate 'less aggressively'. 

The Legendre and Chebyshev cases are similar and are discussed in [2831 I284J . The web- 
page [394J keeps a selected list of publications on spectral viscosity. 



9.8 Going further, applications in numerical relativity 

Based on the minimum gridspacing between spectral collocation points, one would naively expect 
the CFL limit to scale as 1/N 2 , where N is the number of points. The expectation indeed holds, 
but the reason is related to the O (N 2 ) scaling of the eigenvalues of Jacobi polynomials as solutions 
to Sturm-Liouville problems (in fact, the result holds for non-collocation spectral methods as well) 
[206]. 

There are relatively few rigorous results on convergence and stability of Chebyshev collocation 
methods for initial-boundary value problems; some of them are [203 and [205J. 

Even though this review is concerned with time-dependent problems, we note in passing that 
there is a significant number of efforts in relativity using spectral methods for the constraint 
equations. The use of spectral methods in relativistic evolutions can be traced back to pioneering 
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work in the mid eighties [53] (see also [SSJ |M1 1207] ). Over the last decade they have gained 
popularity, with applications in scenarios as diverse as relativistic hydrodynamics [3051 14151 1416] , 
characteristic evolutions [42] . absorbing and/or constraint-preserving boundary conditions [306, 
357, 353, 351], constraint projection |239j . late time "tail" behavior of black hole perturbations 
|370|, l408 . cosmological studies ^7\ H7J 08] , extreme mass ratio inspirals within perturbation theory 
and self-forces [1091 [T57, 108, 413, 111| ,I110|I120| and, prominently, binary black hole simulations 
(see, for example, [57H [MB |M1 H>H1 H2S HO HO Q2E1 [ED HH2] ) and black hole-neutron star ones 



|147U163j . The method of lines (Section 7.3) is typically used with a small enough timestep so that 
the time integration error is smaller than the one due to the spatial approximation and spectral 
convergence is observed. Spectral collocation methods were first used in spherically symmetric 
black hole evolutions of the Einstein equations in [249] and in three dimensions in [248] . The latter 
work showed that some constraint violations in the Einstein-Christoffel [20] type of formulations 
do not go away with resolution but are a feature of the continuum evolution equations (though the 
point -namely, that time instabilities are in some cases not a product of lack of resolution, applies 
to many other scenarios). 

Most of these references use explicit symmetric hyperbolic first order formulations. More re- 
cently, progress has been made towards using spectral methods for the BSSN formulation of the 
Einstein equations directly in second order form in space [40711158] . and, generally, on multi-domain 
interpatch boundary conditions for second order systems [401J (numerical boundary conditions are 
discussed in the next section). A spectral spacetime approach (as opposed to spectral approxima- 
tion in space and marching in time) for the 1+1 wave equation in compactified Minkowski was 
proposed in [228] ; in higher dimensions and dynamical spacetimes the cost of such approach might 
be prohibitive though. 

Reference 81J presents an implementation of the harmonic formulation of the Einstein equations 
on a spherical domain using a double Fourier expansion and, in addition, significant speed-ups using 
Graphics Processing Units (GPUs). 

Ref. [209 presents a detailed review of spectral methods in numerical relativity. 

A promising approach which, until recently, has been largely unexplored within numerical 
relativity is the use of discontinuous Galerkin methods (233. J 4441 11571 [T551 328 . 
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10 Numerical Boundary Conditions 

In practical computations one usually inevitably deals with an initial-boundary value problem and 



numerical boundary conditions in general have to be imposed (but see the discussion in Section 6.4 1. 
In this section we discuss some approaches for doing so, with emphasis on sufficient conditions for 
stability based on the energy method, sinrplicity, and applicability to high order and spectral 
methods. In addition to outer boundaries, we also discuss interface ones appearing when there are 
multiple grids. 

General stability results through the energy method are available for symmetric hyperbolic first 
order linear systems with maximal dissipative boundary conditions. Unfortunately, in many cases 
of physical interest the boundary conditions are often neither in maximal dissipative form nor is 
the system linear. In particular, this is true for Einstein's field equations, which are nonlinear, 
and, as we have seen in Section [61 require constraint-preserving absorbing boundary conditions 
which do not always result in algebraic conditions on the fields at the boundary. Therefore, in 
many cases one does "the best that one can" , implementing the outer boundary conditions using 
discretizations which are known to be stable at least in the linearized, maximal dissipative case. 
Fortunately, since the outer boundaries are usually placed in the weak field, wave zone, more often 
than not this approach works well in practice. At the same time, it should be noted that the initial- 
boundary value problems (IBVPs) for General Relativity formulated in [181] and [257] (discussed 
in Section [5]) are actually based on a symmetric hyperbolic first order reduction of Einstein's field 
equations with maximal dissipative boundary conditions (including constraint-preserving ones). 
Therefore, it should be possible to construct numerical schemes which can be provably stable, 
at least in the linearized regime, using the techniques described in the last two sections and in 



Section 10.1 A numerical implementation of the formulations of Refs. |181j and [257] has not yet 
been pursued. 

The situation at interface boundaries between grids, which are at least partially contained in the 
strong field region, is more subtle. Fortunately, only the characteristic structure of the equations 
is in principle needed at such boundaries, and not constraint-preserving boundary conditions. 
Methods for dealing with interfaces are discussed in Section |10.2| 

Finally, in Section |10.3| we give an overview of some applications to numerical relativity of 
the boundary treatments discussed in Sections |10.1| and |10.2| As mentioned above, most of the 
techniques that we discuss have been mainly developed for first order symmetric hyperbolic systems 



with maximal dissipative boundary conditions. In Section 10.3 we also point out ongoing and 
prospective work for second order systems, as well as the important topic of absorbing boundary 
conditions in General Relativity. 

Most of the methods reviewed below involve decomposition of the principal part, its time 
derivative, or both, into characteristic variables, imposing the boundary conditions and changing 
back to the original variables. This can be done a priori, analytically, and the actual online 
numerical computational cost of these operations is negligible. 

10.1 Outer boundary conditions 

10.1.1 Injection 

Injecting boundary conditions is presumably the simplest way to numerically impose them. It 
implies simply overwriting, at every or some number of timesteps, the numerical solution for each 
incoming characteristic variable or its time derivative with the conditions that they should satisfy. 
Stability of the injection approach can be analyzed through GKS theory [223J, since energy 
estimates are in general not available for it (the reason for this should become more clear below 
when discussing the projection and penalty methods). Stability analyses in general not only depend 
on the spatial approximation (and time integration in the fully discrete case) but are in general 
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also equation-dependent. Stability is therefore in general difficult to establish, especially for high 
order schemes and non-trivial systems. For this reason a semidiscrete eigenvalue analysis is many 
times performed. Even though this only provides necessary conditions (namely, the von Neumann 



condition (25f l) for stability, it serves as a rule of thumb and to discard obviously unstable schemes. 



As an example, we discuss the advection equation with "freezing" boundary condition, 



u t = u x , 

u{0,x) = f(x), 

u(t,l)=9(t), 



see [-1,1], 

see [-M], 
t> o, 



t > o, 



(351) 
(352) 
(353) 



where the boundary data is chosen to be constant, g(t) = /(f). As space approximation we use a 
Chebyshev collocation method at Gauss-Lobatto nodes. The approximation then takes the form 



dt 



u N (t,Xi) = (Qu N ) (t,xi), 



i = 0,l,...,N. 



where Q is the corresponding Chebyshev differentiation matrix. The boundary condition is then 
imposed by replacing the N — th equation by 



± UN (t, Xn ) = lg(t) 



0. 



For this problem and approximation an energy estimate can be derived |204| . but we present an 
eigenvalue analysis as a typical example of those done for more complicated systems and injection 
boundary conditions. Figure HI shows the semi-discrete spectrum using different number of collo- 
cation points N. As needed by the strong version of the von Neumann condition, no positive real 
component is present [cf. Equation (252)]. We also note that, as discussed at the beginning of 



Section 9.8 the spectral radius scales as ~ TV 
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Figure 4: Semidiscrete spectrum for the advection equation and freezing boundary conditions 



[Eqs. (35f 352 353 , a Chebyshev collocation method, and numerical injection of the boundary 
condition. The number of collocation points is (from left to right) N — 20, 40, 60. The scheme 
passes the von Neumann condition (no positive real component in the spectrum). In fact, as 
discussed in the body of the text, the method can be shown to actually be numerically stable. 



There are other difficulties with the injection method, besides the fact that stability results 
are usually partial and incomplete for realistic systems and/or high order or spectral methods. 
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One of them is that it sometimes happens that a full GKS analysis can actually be carried out 
for a simple problem and scheme, and the result turns out to be stable but not time-stable (see 



Section 7.4 for a discussion on time-stability) . Or the scheme is not time-stable when applied to a 
more complicated system (see, for example, [1131 12]). 

Seeking stable numerical boundary conditions for realistic systems which preserve the accuracy 
of high order finite difference and spectral methods has been a recurring theme in numerical 
methods for time-dependent partial differential equations for a long time, especially for non-trivial 
domains, with substantial progress over the last decade - especially with the penalty method 
discussed below. Before doing so we review another method, which improves on the injection one 
in that stability can be shown for rather general systems and arbitrary high order finite difference 
schemes. 

10.1.2 Projections 



Assume that a given IBVP is well posed and admits an energy estimate, as discussed in Section 5.2 
Furthermore, assume that, up to the control of boundary terms, a semi-discrete approximation to 
it also admits an energy estimate. The key idea of the projection method [3091 13111 1310] is to 
impose the boundary conditions by projecting at each time the numerical solution to the space of 
gridfunctions satisfying those conditions, the central aspect being that the projection is chosen to 
be orthogonal with respect to the scalar product under which a semi-discrete energy estimate in 
the absence of boundaries can be shown. The orthogonality of the projection then guarantees that 
the estimate including the control of the boundary term holds. 

In more detail, if the spatial approximation prior to the imposition of the boundary conditions 
is written as 

u t = Qu (354) 

and [for example, as is many times the case when Summation By Parts (SBP) holds] there is a 
semi-discrete energy, 

E d = (u, u) , 

with respect to some scalar product, for which an estimate holds up to boundary terms: 

-r,E d = (Qu, u) + (u, Qu) , 
at 

with 

(Qu, u) + (uQu) < F(t) + boundary terms 

for some F(i) independent of the initial data and resolution. Due to the SBP property, the 
boundary terms are exactly those present in the continuum energy estimate, except that so far 
they cannot be bounded because the numerical solution is not yet required to satisfy the boundary 
conditions. 



The latter are then imposed by changing the semidiscrete equation ( 354 1 to 

u t — VQu 

where V projects u to the space of gridfunctions satisfying the desired boundary conditions, is 
time-independent (which in particular requires the assumption that the boundary conditions are 
time-independent) and symmetric 

(u,Vv) = (Vu,v) 

(and, being a projection, V 2 = V). Since the projection is assumed to be time-independent, 
projecting both sides of Equation ( |354| ) implies that 

(Vu) t = VQu 
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and for any solution of (354) satisfying the boundary conditions, Vu = u, 

u t = VQu . 
Then 

— E d = (VQu, u) + (u, VQu) = (Qu, Vu) + (Vu, Qu) = (Qu, u) + (u, Qu) 
at 

< F(t) + boundary terms . 

Since the solution now satisfies the boundary conditions, the boundary terms can be bounded as 
in the continuum and stability of the semi-discrete IBVP follows. In principle, this method only 
guarantees stability for time- independent, homogeneous boundary conditions. Homogeneity can 
be assumed by a redefinition of the variables, but time-independent boundary conditions is a more 
severe restriction. 

Details on how to explicitly construct the projection can be found in [221] . The orthogonal 
projection method guarantees stability for a large class of problems admitting a continuum energy 
estimate. However, its implementation is somewhat involved. 

10.1.3 Penalty conditions 

A simple and robust method for imposing numerical boundary conditions, either at outer or in- 
terpatch boundaries, such as those appearing in domain decomposition approaches, is through 
penalty terms. The boundary conditions arc not imposed strongly but weakly, preserving the 
convergence order of the spatial approximation and leading to numerical stability for a large class 
of problems. It can be applied both to the case of finite differences and spectral approximations. 
In fact, the spirit of the method can be traced back to Finite Elements-discontinuous Galerkin 
methods (see [144] and [26] for more recent results). Terms are added to the evolution equations 
at the boundaries to consistently penalize the mismatch between the numerical solution and the 
boundary conditions the exact solution is subject to. 

Finite differences. In the finite difference context the method is known as the Simultaneous 
Approximation Technique (SAT) [114] . For semidiscrete approximations of initial-boundary value 
problems of arbitrary high order with an energy estimate, both the order of accuracy and the 
presence of energy estimates are preserved when imposing the boundary conditions through it. 
As an example, consider the half-space IBVP for the advection equation, 

t > 0, (355) 

(356) 
(357) 

where A > is the characteristic speed in the —x direction. 

We first consider a semi-discrete approximation using some finite difference operator D satis- 
fying SBP with respect to a scalar product S, which we assume to be either diagonal or restricted 



u t = Xu x , 


x<0 


u(0,x) = f(x), 


x<0 


u(t,0) = g(t), 


t> 0, 



full (see Section 8.3). As usual, Ax denotes the spacing between gridpoints, 

Xi = iAx, i = 0, —1, —2, .... 

In the SAT approach the boundary conditions are imposed through a penalty term which is applied 
at the boundary point xq, 

—Ui = \Dui^ ^— (g-u ), (358) 

at (Too^^ 
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where 5 i0 is the Kronecker delta, S a real parameter to be restricted below, and Ooo is the 00- 
component of the SBP scalar product S. Introducing the semi-discrete SBP energy 

E = (u,u)-£ , 

its time derivative is 

— E = (A - 2S)u 2 + 2gu S . (359) 

Next, define S through S = A + S. For homogeneous boundary conditions (g = 0) both numerical 
and time-stability follow if S > —A/2: 

— E = -{\ + 28)ul <0. 

Strong stability follows if 5 > —A/2, including the case of inhomogeneous boundary conditions: for 
any e such that 

0<e 2 <25- A, 



Equation (359) implies 



2 / n\ 2 



" 7n /\ na\..2 , n/„. \ [ 9&\ ^ f\ na\„.2 , /„2„.2\ , ( 9& \ -.2 , ( ^ \ ..: 



^-B = (A - 2^)^ + 2( £Wo ) 1^-1 < (A - 2SX + (e «o) + ^ J = -««5 + [- ) <f , (360) 

where 

e := 2S - A - e 2 > . 



Integrating Equation (360) in time, 



\\u(t)\\l+ej\ (r) 2 dT< (£) J\(r) 2 dr, 

thereby proving strong stability. 

In the case of diagonal SBP norms it is straightforward to derive similar energy estimates for 
general linear symmetric hyperbolic systems of equations in several dimensions, simply by working 
with each characteristic variable at a time, at each boundary. A penalty term is applied to the 



evolution term of each incoming characteristic variable at a time as in Equation (358), where A is 
replaced by the corresponding characteristic speed. In particular, edges and corners are dealt by 
simply imposing the boundary conditions with respect to the normal to each boundary, and an 
energy estimate follows. 

One can show what the global semidiscrete convergence rate is as follows. Define the error 
gridfunction as the difference between the numerical solution u and the exact one vS e > evaluated 
at the gridpoints, 

e i {t) = e(t,x i ):=u{t,x i )-u (e) (t,x i ), i = 0, -1, -2, . . . , (361) 

It satisfies the equation 

— e,i = XDei H — e - Ft , 

at cr o Aa; 

where Fi denotes the truncation error, here solely depending on the differentiation approximation: 

Fi(t)=F(t,Xi) = \(u^{t,Xi)-Du^(t,Xi)) . (362) 

0(Ax) r at and close to boundaries , /qrq\ 

0(Ax) 2p in the interior , ^ ' 
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with r < 2p in general. For example, in the diagonal case r 
r = 2p — 1 [see the discussion below Equation (289 (]. 



p, and in the restricted full one 



Using Equation (361) and the SBP property the norm of the error satisfies 
d „ 



eft 



= (A-25)e^-2(e,F) s <2||e|| s ||F|| 



(364) 



where we have used S < A/2 and the Cauchy-Schwarz inequality in the second step. Dividing both 
sides of the inequality by 2||e||s and integrating we obtain 



t 
|e(t)|| s < J \\F(T)\\vdT, 

b 



t> 0. 



Since by Equation (363) the norm of the truncation error is of the order (r + 1/2), it follows that 
the S-norm of the error converges to zero with rate (r + 1/2). This also implies that the error 
converges pointwise to zero with rate, 



e l (t) = 0((AxY 



t > 0, 



0,-1,-2, 



In particular, this proves that the error at the boundary point xq = converges to zero, and 



therefore, the penalty term in Equation (358) is finite. Two remarks are in order 



• In special cases it is possible to improve the error estimate exploiting the strong stability of the 
problem. Consider for instance the case of the -D2-1 operator defined in Example 51 with the 
associated diagonal scalar product (<?ij) = diag(l/2, 1, 1, 1, ...). Then, Equation (364) gives 



dt M 



(A - 2S)el - 2(e, F) x = (A - 2S)e 2 a - Axe F - 2Ax ^ e^ 



< 



< 



-2 1 



(A - 2S) + - 

e\\l + 0((&x 



Ax 2 2 
2e* ° 



Ax 



-1 

i— — 00 



Ft 



where we have chosen < e 2 /2 < 25 — A. This implies that the error converges to zero in 
the S-norm with second order and pointwise with order 3/2. 

• At any fixed resolution, the error typically^] decreases with larger values of the penalty 
parameter 5, but the spectral radius of the discretization grows quickly in the process by 
effectively introducing dissipative eigenvalues (on the left half plane) in the spectrum, leading 
to demanding CFL limits (see, for example J273 ). Because the method is usually applied 
along with high order methods, decreasing the error for a fixed resolution at the expense 
of increasing the CFL limit does not seem worthwhile. In practice values of 5 in the range 
—A/2 < 5 < give reasonable CFL limits. 



Spectral methods. The penalty method for imposing boundary conditions was actually intro- 
duced for spectral methods prior to finite differences in Refs. |192l 1193] . In fact, as we will see 
below, the finite difference and spectral cases follow very similar paths. Here we only discuss its 
application to the collocation method. Furthermore, as discussed in Section |9.4[ when solving an 
initial-boundary value problem, Gauss-Lobatto collocation points are natural among Gauss-type 



30 There are cases in which this is not true, at least for not too large penalty strengths. 
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nodes, because they include the end points of the interval. We restrict our review to them, but the 
penalty method applies equally well to the other nodes. We refer to [231] for a thorough analysis 
of spectral penalty methods. 

Like the finite difference case we summarize the method through the example of the advection 



problem (355 355 356 ), except that now we consider the bounded domain x £ [—1,1] and apply the 
boundary condition at x = 1. Furthermore, we first consider a truncated expansion in Legendre 
polynomials. A penalty term with strength r is added to the evolution equation at the last 
collocation point: 

—u N (t,x l ) = \u / N (t,x i )+T\(g(t)-u N (t,x n ))8. lN i = 0,l,...,N, (365) 

where {xi}^L Q are now the Gauss-Lobatto nodes; in particular, xq = —1 and Xn = 1. 



Using the discrete energy given by Equation ( 349 1 and Summation by Parts property associated 



with Gauss quadratures discussed in Section |9.4| a discrete energy estimate follows exactly as in 
the finite difference case if 

where ^4^ is the Legendre-Gauss-Lobatto quadrature weight at the right end point xm — 1 [cf. 



Equations (335 336)]. Notice that this scaling of the penalty with the weight is the analog of 



the (cooA^) SBP weight term for the finite difference case [Equation 358 . The similitude is not 



accidental: both weights arrive from the discrete integration formulae for the energy. Notice: 

• The approach for linear symmetric hyperbolic systems is the same one that we discussed for 
finite differences: the evolution equation for each characteristic variable is penalized as in 
Equation |365[ where A is replaced by the corresponding characteristic speed. 

Devising a Chebyshev penalty scheme that guarantees stability is more convoluted. In partic- 
ular, the advection equation is already not well posed in the Chebyshev norm, for example (see 



[204]). Stability in the L2 norm can be established using the Chebyshev-Legendre method |141) . 
where the Chebyshev-Gauss-Lobatto nodes are used for the approximation but the Legendre ones 
for satisfying the equation. In this approach, the penalty method is global, because it adds terms 
to the right-hand side of the equations not only at the endpoint, but at all other collocation points 
as well. 

A simpler approach, where the penalty term is only applied to the boundary collocation point, 
as in the Legendre and finite differences case, is to show stability in a different norm. For example, 



in Ref. [134] it is shown that a penalty term as in Equation (365) is stable for the Chebyshev 



Gauss-Lobatto case in the norm defined by the weight, cf. Equation (310), 



1 + x 

Ul(x) = (1 + x)uJ C heby S hev(x) = \l z , -1 < X < 1, 



if 



N 2 
r>^- (367) 

In Figure p^ we show the spectrum of an approximation of the advection equation (with speed 
A = 1 and g(t) = 0) using the Chebyshev penalty method, N = 20,40,60 collocation points and 
S := tN 2 = 0.3. There are no eigenvalues with positive real component. Even though this is only 
a necessary condition for stability, it suggests that it might be possible to lower the bound ( |367 ) 



on the minimum penalty needed for stability. On the other hand, for S = 0.2 (and lower values) 
real positive values do appear in the spectrum. Compared to the injection case (Figure El), the 
spectral radius for the cases shown in Figure [5] are about a factor of two larger. However, in most 
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applications using the method of lines the timestep is dictated by the condition of keeping the time 
integration error smaller than the spatial one, so as to maintain spectral convergence, and not the 
CFL limit. 
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Figure 5: Spectrum of the Chebyshev penalty method for the advection equation and (from left 
to right) N = 20,40,60 collocation points and S = 0.3. 



10.2 Interface boundary conditions 

Interface boundary conditions are needed when there are multiple grids in the computational 
domain, as discussed in Section [IT] below. This applies to complex geometries, when multiple 
patches are used for computational efficiency, to adapt the domain decomposition to the wave 
zone, mesh refinement, or a combination of these. 

A simple approach for exchanging the information between two or more grids is a combination 
of interpolation and extrapolation of the whole state vector at the intersection of the grids. This 
is the method of choice in mesh refinement, for example, and works well in practice. In the case of 
curvilinear grids and very high order finite difference or spectral methods, though, it is in general 
not only difficult to prove numerical stability, but even to find a scheme that exhibits stability 
from a practical point of view. 

10.2.1 Penalty conditions 

The penalty method discussed above for outer boundary conditions can also be used for multi- 
domain interface ones, including those present in complex geometries 112, 230, 115, 303, 304, 137 . 
It is simple to implement, robust, leads to stability for a very large class of problems and preserves 
the accuracy of arbitrary high order difference and spectral methods. 

Finite differences. As an example consider again an advection equation but now on the whole 
real line 



Ut — \u x , 
u(0,x) = f(x), 



-OO < X < OO, 

-oo < x < oo, 



t > 0, 



where we assume that the initial data / is C°° smooth and has compact support. The domain 
of the real line is chosen just for simplicity, to focus on the interface procedure at x = 0. In the 
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realistic case of compact domains, outer boundaries are also present and these can be treated by 
any of the methods discussed in Section fl0.1| above. 

Consider two grids, left and right, covering the intervals (— oo,0], and [0, +00), respectively: 

x\ = iAx 1 , i = 0,-1,-2,..., 

x\ = iAx\ i = 0,1,2,..., 

where the gridspacings need not agree, Ax 1 ^ Ax r , and the difference operators D l and D r , which 
are not necessarily equal to each other either, and do not even need to be of the same order of 
accuracy, satisfy SBP with respect to scalar products given by the weights a 1 , a r on their individual 

grids: 

+00 

{v\v% t =Ax l Y, ^M > (v r ,v r )j: r =Ax r J2^ v > r 3 - ( 368 ) 

For the time being assume that both SBP scalar products are diagonal, though. The SAT 
semi-discrete approximation to the problem then is 

j t v\ = XD l v\ + ^ r {v r Q -vl), i = 0,-1,-2,..., (369) 

~< = ADX+^(^-«o r ), i = 0,1,2,.... (370) 

Notice that in this approach the numerical solution at any fixed resolution is bi-valued at the 
interface boundary x — 0, and in the same spirit as the penalty approach for outer boundary 
conditions, continuity of the fields at the interface is not enforced strongly but weakly. 
Defining the energy 

E:=(v I V)i: 1 + <t> r J t; r > Sp , 



and using the approximation (369 3701 and the SBP property of the difference operators, its time 
derivative is 

j t E={\- 2S l ){u l ) 2 + (-A - 2S r )(u r Q ) 2 + 2(5' + S r )u l u r . 

Then an estimate follows if two conditions are satisfied. One of them is A + S r — Si — 0. The other 
one imposes an additional constraint on the values of Si and S r - 

• Positive X: 

Si = A + 5, S r = 5, with 5 > — — 

The estimate is 

±E = -(u l Q -u r Q ) 2 (X + 2S)<0. 

• Negative X: this is obtained from the previous case after the transformation A n- — A 

S r = —A + 5, Si = 5, with S > — 

and 

j t E = (u l -u r ) 2 (X-26)<Q. 

• Vanishing X: this can be seen as the limiting case of any of the above two, with 

l E =-(u l o -u r o f2S<0. 
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The following remarks are in order: 

• For the minimum values of S allowed by the above inequalities the energy estimate is the 
same as for the single grid case with outer boundary conditions, see Section [10.1.3[ and the 



discretization is time-stable (see Section 7.4 ) , while for larger values of S there is damping in 



the energy which is proportional to the mismatch at the interface. 

• Except for the case of the most natural choice 6 — 0, the evolution equations for outgoing 
modes also need to be penalized in a consistent way in order to derive an energy estimate. 
However, as is always the case, the lack of an energy-type estimate does not mean that the 
scheme is unstable, since the energy method provides sufficient but not always necessary 
conditions for stability. 

• The general case of symmetric hyperbolic systems follows along the same lines: a decompo- 
sition into characteristic variables is performed and the evolution equation for each of them 
is penalized as in the advection equation example. At least for diagonal norms, stability 
also follows for general linear symmetric hyperbolic systems in several dimensions. With the 
standard caveats for non-diagonal norms, the procedure follows in a similar way except that 
penalty terms are not only added to the evolution equations at the interface on each grid 
but also near them. In practice, though, applying penalties just at the interfaces appears to 
work well in practice in many situations. 

Spectral methods. The standard procedure for interface spectral methods is to penalize each 
incoming characteristic variable, exactly as in the outer boundary condition case. Namely, as in 



Equation ( 365 ) with lower bounds for the penalty strengths given by Equations ( 366 ) and ( 367 ) 
for Legendre and Chebyshev polynomials, respectively. We know from the finite difference analysis 
above, though, that in general this does not imply an energy estimate and in order to achieve 
one outgoing modes also need to be penalized, with strengths that are coupled to the penalty for 
incoming modes. However, the procedure of penalizing just incoming modes at interfaces appears 
to works well in practice, so we analyze this in some detail. 

Figure [6] shows the spectrum of the Chebyshev penalty method as described, for an advection 
equation, 

(!(=«!, —1 < X < 1, t > 0, 

where there is an interface boundary at x — 0. In more detail: the figure shows the maximum real 
component in the spectrum for N = 20 collocation points as a function of the penalty strength S. 
There are no real positive values, and this remains true for different values of N, in agreement with 
the fact that penalizing just the incoming mode works well in practice. The figure also supports 
the possibility that S > 0.3 might actually be enough for stability, as in the outer boundary case. 
The figure also shows the spectral radius as a function of the penalty strength. Beyond S = 1 
it grows very quickly, and even though as mentioned the timestep is usually determined by keeping 
the time integration error below that one due to spatial discretization, that might not be the case 
if the spectral radius is too large. Thus, it is probably good to keep S < 1. 

10.3 Going further, applications in numerical relativity 

In numerical relativity, the projection method for outer boundary conditions has been used in 
references [99j I100[ I409| I270[ 1217] . the penalty finite difference one for multi-domain boundary 
conditions in [5731 EM I373J Q32J HHSl [3J7J SH [25UJ SHI SH] > and for spectral methods in - among 
many other references - [1571 13531 I571 ITo31 12551 13571 15501 1T251 ITM I2^H IM1 1551 1251 15551 1T4T1 1572] . 
Ref. [2881 presents a comparison, for the finite difference case, of numerical boundary conditions 
through injection, orthogonal projections, and penalty terms. One additional advantage of the 
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Figure 6: Maximum real component (left) and spectral radii (right) versus penalty strength S, for 
the Chebyshev penalty method for the advection equation with two domains (N = 20 collocation 
points). 



penalty approach is that for advection-diffusion problems it damps away in time violations of the 
compatibility conditions between the initial and boundary data, see Section [5] Figure [7] shows the 
error as a function of time for an advection-diffusion equation where the initial data is perturbed 
so that an inconsistency with the boundary condition is present, and the error is defined as the 
difference between the unperturbed and perturbed solutions. See Ref. [288 for more details. One 
expects this not to be the case for hyperbolic problems, though. For example, at the continuum 
an incompatibility between the initial data and initial-boundary condition for the advection would 
propagate, not dissipate in time, and a consistent and convergent numerical scheme should reflect 
this behavior. 

Many systems in numerical relativity, starting with Einstein's equations themselves, are numer- 
ically solved by reducing them to first order systems because there is a large pool of advanced and 
mature analytical and numerical techniques for them. However, this is at the expense of enlarging 
the system and, in particular, introducing extra constraints (though this seems to be less of a con- 
cern; see, for example, [278, 751). It seems more natural to solve such equations directly in second 
order (at least in space) form. It turns out, though, that it is considerably more complicated to 
ensure stability for such systems. Trying to "integrate back" an algorithm for a first order reduc- 
tion to its original second order form is in general not possible; see, for example, Refs. 98, 4(JT] for 
a discussion of this apparent paradox and the difficulties associated with constructing boundary 
closures for second order systems such that an energy estimate follows. 

The SAT approach was generalized to a class of wave equations in second order form in space 
in Refs. [292, 294, I291|I118| . Part of the difficulty in obtaining an energy estimate for second order 
variable coefficient systems in these approaches is related to the property that the finite difference 
operators now depend on the equations being solved. This also complicates their generalization to 
arbitrary systems. For example, in Ref. [291| deals with systems of the form 



au tt = (bu x ) a 



0<x< 1, 



where a, b : [0, 1] — > K are strictly positive, smooth functions, and difference and dissipative 
operators of the desired order satisfying SBP approximating (bu x ) x are constructed. See also 
Ref. |294j for generalizations which include shift-type terms. 

In |293| the projection method and high order SBP operators approximating second derivatives 
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Figure 7: Comparison of different numerical boundary approaches for an advection-diffusion equa- 
tion where the initial data is perturbed, introducing an inconsistency with the boundary condition 
[288J . The SAT approach, besides guaranteeing time stability for general systems, "washes out" 
this inconsistency in time. "MODIFIED P" corresponds to a modified projection [220| for which 
this is solved at the expense of losing an energy estimate. Courtesy: Ken Mattsson. 

were used to provide interface boundary conditions for a wave equation (directly in second order 
in space form) in discontinuous media while guaranteeing an energy estimate. The domain in this 
work is a rectangular multi-block domain with piecewise constant coefficients, where the interfaces 
coincide with the location of the discontinuities. This approach was generalized, using the SAT 
for the jump discontinuities instead of projection, to variable coefficients and complex geometries 
in [250] . 



The difficulty is not related to finite differences: in |401j a penalty multi-domain method was 
derived for second order systems. For the Legendre and constant coefficient case the method 
guarantees an energy estimate but difficulties are reported in guaranteeing one in the variable 
coefficient case. Nevertheless, it appears to work well in practice in the variable coefficient case 
as well. An interesting aspect of the approach of |401j is that an energy estimate is obtained by 
applying the penalty in the whole domain (as an analogy we recall the above discussion about the 



Legendre-Chebyshev penalty method in Section 10.1.3) 



A recent generalization, valid both for finite differences and - at least Legendre - collocation 
methods (as discussed, the underlying tool is the same: Summation by Parts), to more general 
penalty couplings, where the penalty terms are not scalar but matrices (i.e., there is coupling 
between the penalty for different characteristic variables) can be found in [116J . 

Energy estimates are in general lost when the different grids are not conforming (different types 
of domain decompositions are discussed in the next section) , and interpolation is needed. This is 
the case when using overlapping patches with curvilinear coordinates but also mesh refinement with 
Cartesian, nested boxes (see, for example, Ref. 269 and references therein). A recent promising 
development has been the introduction of a procedure for systematically constructing interpolation 
operators preserving the Summation by Parts property for arbitrary high order cases, see Ref. 289 . 



Numerical tests are presented with a 2:1 refinement ratio, where the design convergence rate is 
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observed. It is not clear whether reflections at refinement boundaries such as those reported in 
Ref. [37| would still need to be addressed or they would be taken care of by the high order accuracy. 

10.3.1 Absorbing boundary conditions 

Finally, we mention some results in numerical relativity concerning absorbing artificial boundaries. 
In Ref. |306j , boundary conditions based on the work of Ref. [45] , which are perfectly absorbing for 
quadrupolar solutions of the flat wave equation, were numerically implemented via spectral meth- 
ods, and proposed to be used in a constrained evolution scheme of Einstein's field equations !63. 
For a different method which provides exact, nonlocal outer boundary conditions for linearized 



gravitational waves propagating on a Schwarzschild background, sec [263, 2621 1264) . A numeri- 
cal implementation of the well-posed harmonic IBVP with Sommerfeld-type boundary conditions 
given in Ref. |260j was worked out in [55] , where the accuracy of the boundary conditions was also 
tested. 

In Ref. |354) . various boundary treatments for the Einstein equations were compared to each 
other using the test problem of a Schwarzschild black hole perturbed by an outgoing gravitational 
wave. The solutions from different boundary algorithms were compared to a reference numerical 
solution obtained by placing the outer boundary at a distance large enough to be causally discon- 
nected from the interior spacetime region where the comparison was performed. The numerical 
implementation in 354 was based on the harmonic formulation described in |278j . 

In Figure pi a comparison is shown between (a) simple boundary conditions which just freeze 
the incoming characteristic fields to their initial value, and (b) constraint-preserving boundary 
conditions controlling the complex Weyl scalar >Fo at the boundary. The boundary surface is an 
approximate metric sphere of areal radius R — 41.9 M, with M the mass of the black hole. The 
left side of the figure demonstrates that case (a) leads to significantly larger reflections than case 
(b). The difference with the reference solution after the first reflection at the boundary is not 
only large in case (a), but also it does not decrease with increasing resolution. Furthermore, the 
violations of the constraints shown in the right side of the figure do not converge away in case (a) , 
indicating that one does not obtain a solution to Einstein's field equations in the continuum limit. 
In contrast to this, the difference with the reference solution and the constraint violations both 
decrease with increasing resolution in case (b). 

A similar comparison was performed for alternative boundary conditions, including spatial 
compactifications and sponge layers. The errors in the gravitational waves were also estimated 
in [354], by computing the complex Weyl scalar ^4 for the different boundary treatments, see 
Figure 10 in that reference. 

Based on the construction of exact in- and outgoing solutions to the linearized Bianchi identities 



on a Minkowksi background (cf. Example 32 1, the reflection coefficient 7 for spurious reflections 
at a spherical boundary of areal radius R which sets the Weyl scalar ^0 to zero was estimated to 
be [85] 

\ 7 (kR)\ « \{kR)-\ (371) 

for outgoing quadrupolar gravitational radiation with wave number k ^> Rr 1 . Figure [9] shows the 
Weyl scalars >Fo and "J 4 computed for the boundary conditions in case (b) and extracted at 1.9 M 
inside the outer boundary. By computing their Fourier transform in time, the overall dependence 
of the ratio agrees very well with the predicted reflection coefficient. 

For higher-order absorbing boundary conditions, which involve derivatives of the Weyl scalar 
^0, see Ref. [357] . and Ref. |353] for their numerical implementation. 
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Figure 8: Comparison between boundary conditions in case (a) (solid) and case (b) (dotted); see 
the body of the text for more details. Four different resolutions are shown: (N r ,L) = (21,8), 
(31,10), (41,12) and (51,14), where N r and L refer to the number of collocation points in the 
radial and angular directions, where Chebyshev and spherical harmonics are used, respectively. 
Left panel: the difference MA between the solution with outer boundary at R = 41.9 M and the 
reference solution. Right panel: the constraint violation C (see Ref. |354j for precise definitions of 
these quantities and further details). Courtesy: Oliver Rhine. 
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Figure 9: Comparison of the time Fourier transform of \&o and ^4 for two different radii of the 
outer boundary. The leveling off of \&q for kM > 3 is due to numerical roundoff effects (note the 
magnitude of ^q at those frequencies). Courtesy: Oliver Rinne. 
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11 Domain Decomposition 

Most three-dimensional codes solving the Einstein equations currently use several non-uniform 
grids/numerical domains. Adaptive mesh refinement (AMR) a la Berger-Oliger [46] . where the 
computational domain is covered with a set of nested grids, usually taken to be Cartesian ones, 
is used by many efforts. See, for instance, Refs. [3741 [3271 [3821 ESS1 UM EH EEU ESI (821 IT061 
I418[ 429, 426. I153[l314j ). Other approaches use multiple patches with curvilinear coordinates, or 
a combination of both. Typical simulations of Einstein's equations do not fall into the category of 
complex geometries and usually require a fairly "simple" domain decomposition (in comparison to 
fully unstructured approaches in other fields). 

Below we give a brief overview of some domain decomposition approaches. Our discussion is 
far from exhaustive, and only a few representative samples from the rich variety of efforts are 
mentioned. In the context of Cauchy evolutions, the use of multiple patches in numerical relativity 
was first advocated and pursued by Thornburg [4051 406 . 

11.1 The power and need of adaptivity 

Physical problems governed by nonlinear equations can develop small scale structures which are 
rarely easy to predict. The canonical example in hydrodynamics is turbulence, which results in 
short wavelength features up to the viscous scale. An example in General Relativity of arbitrarily 
small scales is that one of critical phenomena |122[ 1215] , where the solution develops a self-similar 
behavior revealing a universal approach to a singular one describing a naked singularity. Uncovering 
this phenomena crucially required to dynamically adjust the grid structure to respond to the 
(exponentially) ever-shrinking features of the solution. Recently, such need was also demonstrated 



quite clearly by the resolution of the final fate of unstable black strings [2711 1272] . This work 
followed the dynamics, in five dimensional spacetimes, of an unstable black string: a black hole 
with topology S 2 x S 1 , with the asymptotic length of S 1 over the black hole mass per unit length 
above the critical value for linearized stability [2101 I211J . As the evolution unfolds, pieces of 
the string shrink and elongate (so that the area increases), yielding another unstable stage, see 



Figure 10 This behavior repeats in a self-similar manner, the black string developing a fractal 
structure of thin black strings joining spherical black holes. This behavior was followed through 
four generations and the numerical grid refined in some regions up to a factor of 2 17 compared 
to the initial one. This allowed the authors to extrapolate the observed behavior and conclude 
that the spacetime will develop naked singularities in finite time from generic conditions, thereby 
providing a counterexample to the cosmic censorship conjecture in five dimensions. 

11.2 Adaptive mesh refinement for BBH in higher dimensional gravity 

In |426j the authors numerically evolved binary black holes "in a box" by imposing reflecting outer 
boundary conditions which mimic the Anti-de Sitter spacetime. These conditions are imposed 
on all the fields and the outer boundary is taken to have spherical shape, which is approximated 
by a "Lego" sphere. Figure [IT] on the left illustrates a Lego-sphere around a black hole binary, 
mimicking an asymptotically Anti-de Sitter spacetime. A computational domain is schematically 
displayed using four refinement levels with one or two components each. The individual com- 
ponents are labeled G z m , where the indices i and m denote the refinement level and component 
number, respectively. At the spherical boundary, marked by X, reflecting boundary conditions 
were imposed. The right figure shows mesh refinement around two black holes, with the apparent 
horizons represented by the white grid. Further extensions of this research program to numerically 
study black holes in higher dimensional gravity can be found in [42 8 L 1425] . 
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Figure 10: Early (left) and late (right) stages of the apparent horizon describing the evolution of 
an unstable black string. Courtesy: Luis Lehner and Frans Pretorius. 
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Figure 11: Left: Lego-sphere around a black hole binary. Right: Mesh refinement around two 
black holes. Courtesy: Vitor Cardoso, Ulrich Sperhake and Helvi Witek. 
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11.3 Adaptive mesh refinement and curvilinear grids 

In Rcfs. 322, 323 the authors introduced an approach which combines the advantages of adaptive 
mesh refinement near the "sources" (say, black holes) with curvilinear coordinates adapted to 
the wave zone, see Figure |12| The patches are communicated using polynomial interpolation in 
its Lagrange form, as explained in Section |8.1[ and centered stencils are used, both for finite 
differencing and interpolation. Up to eighth order finite differencing is used, with an observed 
convergence rate between six and eight in the (t = 2 = m) modes of the computed gravitational 
waves (parts of the scheme have of lower order convergence rate, but they do not appear to 



dominate). Presently, the BSSN formulation of the Einstein's equations as described in Section 4.3 
is used directly in its second order in space form, with outgoing boundary conditions for all the 
fields. The implementation is generic and flexible enough to allow for other systems of equations, 
though. As in most approaches using curvilinear coordinates in numerical relativity, the field 
variables are expressed in a global coordinate frame. This might sound unnatural and against the 
idea of using local patches and coordinates. However, it simplifies dramatically any implementation. 
It is also particularly important when taking into account that most formulations of Einstein's 
equations and coordinate conditions used are not covariant. 

This hybrid approach has been used in several applications. In particular, for validating ex- 
trapolation procedures of gravitational waves extracted from numerical simulations at finite radii 
to large distances from the "sources" |322j . Since the outermost grid structure is well adapted 
to the wave zone the outer boundary can be located at large distances with only linear cost on 
its location. Other applications include Cauchy-Characteristic extraction (CCE) of gravitational 
waves 13381 151] , a waveform hybrid development |359j , and studies of memory effect in gravitational 
waves [321] . The accuracy necessary to study small memory effects is enabled both by the grid 
structure - being able to locate the outer boundary far away - and CCE. 

11.4 Spectral multi-domain binary black hole evolutions 

In current binary black hole evolutions using spectral collocation methods there are typically three 
sets of spherical shells, one around each black hole and one in the wave zone. These three shells 
are connected by subdomains of various shapes and sizes. Figure [13] shows the global structure 
of one such grid-structure, emphasizing the spherical patches in the wave zone and how the dif- 
ferent domains are connected. Inter-domain boundary conditions are set by the spectral penalty 



method described in Section 10 The adaptivity provided by domain decomposition in addition 
to the spectral convergence rate has lead to the highest accuracy binary black hole simulations 
to date. Currently these evolutions use a first order symmetry hyperbolic reduction of the har- 



monic system with constraint damping as derived in |278j and summarized in Section 4.1 with 
constraint-preserving boundary conditions as designed in [2781 13541 1351) . The field variables are 
expressed in an "inertial" Cartesian coordinate system which is related to one fixed to the compu- 
tational domain through a dynamically defined coordinate transformation tracking the black holes 
(the "dual frame" method) [372] . 

Simulating non-vacuum systems such as relativistic hydrodynamical ones using spectral meth- 
ods can be problematic, particularly when surfaces, shocks, or other non-smooth behavior appears 
in the fluid. Without further processing the fast convergence is lost, and Gibbs' oscillations can 
destabilize the simulation. A method that has been successfully used to overcome this in general 
relativistic hydrodynamics is evolving the spacetime metric and the fluid on two different grids, 
each using different numerical techniques. The spacetime is evolved spectrally, while the fluid is 
evolved using standard finite difference/finite volume shock-capturing techniques on a separate 
uniform grid. The first code adopting this approach was described in Ref. |139j . which is a stellar 
collapse code assuming a conformally flat three-metric, with the resulting elliptic equations being 
solved spectrally. The two-grid approach was adopted for full numerical relativity simulations of 
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Figure 12: Combining adaptive mesh refinement with curvilinear grids adapted to the wave zone. 
Courtesy: Denis Pollncy. 
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Figure 13: Sample domain decomposition used in spectral evolutions of binary black holes. The 
bottom plot illustrates how the coordinate shape of the excision domain is kept proportional to 
the coordinate shape of the black holes themselves. Courtesy: Bela Szilagyi. 



black hole-neutron star binaries in Refs. J147L 11461 I163J . The main advantage of this method when 
applied to binary systems is that at any given time the fluid evolution grid only needs to extend 
as far as the neutron star matter. During the pre-merger phase, then, this grid can be chosen to 
be a small box around the neutron star, achieving very high resolution for the fluid evolution at 
low computational cost. More recently, in Ref. ^ 163] an automated re-gridder was added, so that 
the fluid grid automatically adjusts itself at discrete times to accommodate expansion or contrac- 
tion of the nuclear matter. The main disadvantage of the two-grid method is the large amount of 
interpolation required for the two grids to communicate with each other. Straightforward spec- 
tral interpolation would be prohibitively expensive, but a combination of spectral refinement and 
polynomial interpolation |67j reduces the cost to about 20-30 percent of the simulation time. 



11.5 Multi-domain studies of accretion disks around black holes 

The freedom to choose arbitrary (smooth) coordinate transformations allows the design of sophis- 
ticated problem-fitted meshes to address a number of practical issues. In [250J the authors used 
a hybrid multiblock approach for a general relativistic hydrodynamics code developed in [443] to 
study instabilities in accretion disks around black holes in the context of gamma-ray burst central 
engines. They evolved the spacetime metric using the first order form of the generalized har- 



monic formulation of the Einstein equations (see Section 4.1| on conforming grids, while using a 



high resolution shock capturing scheme for relativistic fluids on the same grid but with additional 
overlapping boundary zones (see Ref. |443) for details on the method). The metric differentiation 



was performed using the optimized -Ds-4 finite difference operators satisfying the Summation by 
Parts property, as described in Section [8. 3| The authors made extensive use of adapted curvilinear 
coordinates in order to achieve desired resolutions in different parts of the domain and to make the 
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coordinate lines conform to the shape of the solution. Maximal dissipative boundary conditions as 
defined in Section [5T2] were applied to the incoming fields, and inter-domain boundary conditions 
for the metric were implemented using the finite-difference version of the penalty method described 
in Section fTUl 

Figure [M] shows examples of the type of mesh adaptation used. The top left panel shows the 
meridional cut of an accretion disk on a uniform multiblock mesh (model C in |250| ). The top 
right panel gives an example of a mesh with adapted radial coordinate lines which resolves the disk 
more accurately than the mesh with uniform grid resolution (see [250] for details on the particular 
coordinate transformations used to obtain such grid). A 3D view of such multi-domain mesh at 
large radii is shown on the bottom left panel of Figure |14| In the area near the inner radius where 
the central black hole is located, the resolution is high enough to accurately resolve the shape and 
the dynamics of the black hole horizon. Finally, near the disk the resolution across the disk in 
both radial and angular directions is made approximately equal and sufficiently high to resolve the 



transverse disk dynamics. The bottom right panel of Figure 14 shows the 3D view of the adapted 
mesh in the vicinity of the disk. 




Figure 14: Domain decomposition used in evolutions of accretion disks around black holes 250 , 
see the text for more details. Courtesy: Oleg Korobkin. 
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11.6 Finite difference multi-block orbiting binary black holes simula- 
tions 

In [317J orbiting binary black hole simulations using a high order Unite difference multi-domain 



approach were presented. The basic layout of the full domain used is shown in Figures 15 and 16 
where the centers of the excised black hole spheres are located along the x axis at x = ±a. The 
computational domain is based on two types of building blocks, shown in Figure |17[ which are 
essentially variations of cubed spheres. In the simulations of [317 the optimized Z?8_4 difference 
operator described in Section |8.3| was used, the patches were communicated using the penalty 



technique (see Section 10 1, and the resulting equations were evolved in time using an embedded 
fourth-fifth order Runge-Kutta stepper (see Section 7.5). The formulations of the equations used, 



boundary conditions and dual frame technique are exactly those of the spectral evolutions discussed 
above, only the numerical and domain decomposition approaches differ, which follow building work 
from [2731 11381 13731 11421 1316] . Strong and weak scaling is observed up to at least several thousand 
processors. 
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Figure 15: Equatorial cut of the computational domain used in multi-block simulations of orbiting 
binary black holes (left). Schematic figure showing the direction considered as radial (red arrows) 
for the cuboidal blocks (right). 




Figure 16: Multi-block domain decomposition for a binary black hole simulation. 
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Figure 17: Equatorial cross-section of varations of cubed sphere patches. 
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